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FOREWORD 


This  report  contains  the  lecture  notes  prepared  by 
Professor  Francis  D.  Murnaghan  for  use  in  the  Second  Annual 
Lecture  Series  on  Applied  Mathematics.  This  series  of  lectures, 
on  the  Calculus  of  Variations,  was  presented  by  Professor  Murnaghan 
in  the  spring  of  i960,  under  the  sponsorship  of  the  Applied  Mathe¬ 
matics  Laboratory,  David  Taylor  Model  Basin. 

Just  as  in  the  case  of  the  first  lecture  series  in  Applied 
Mathematics, on  the  subject  of  the  Laplace  Transformation, 

Professor  Murnaghan  has  made  a  unique  contribution  to  the 
instruction  of  applied  mathematics,  at  a  time  when  progress  in 
this  field  is  so  vitally  important  to  our  national  aims.  The  Applied 
Mathematics  Laborptory  is  proud  to  make  these  lectures  available 
in  report  form. 
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Technical  Director 
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ABSTRACT 


These  lectures  on  applied  mathematics  are  devoted  to  the  Calculus 
of  Variations, with  especial  attention  to  the  applications  of  this  subject  to 
mechanic®  and  wave -propagation.  Our  aim  has  been  to  give  more  than  a 
superficial  account,  treating  only  extremal  or  stationary  curves,  and  in 
order  to  keep  the  treatment  self-contained  and  free  of  formidaole  mathe¬ 
matical  difficulties, we  have  made  the  necessary  differentiability  assumptions 
which  are  usually  satisfied  in  the  applications  of  the  Calculus  of  Variations 
to  problems  of  nature.  We  have  emphasized  the  parametric  treatment  since 
this  simplifies  the  passage  from  the  Lagrangian  to  the  Hamiltonian  point  of 
view  or  vice  versa.  The  distinction  between  extremal  and  minimal  curves 
is  clearly  explained,and  Legendre  s  condition  for  a  minimal  curve  is  proved 
Extremal  fields  and  the  Hilbert  Invariant  Integral  are  treated  tn  some  detail, 
the  essential  distinction  between  plane  problems  and  problems  in  space  of 
three  or  more  dimensions,  as  far  as  extremal  fields  are  concerned,  being 
carefully  explained.  The  Weierstrass  E-functiin  is  treated  and  the 
connection  between  the  Calculus  of  Variations  and  the  Rayleigh  quotients 
and  the  method  of  Rayleigh-Ritz.  so  important  ir.  vibration  problems,  is 
clearly  shown.  In  the  treatment  of  wave-propagation  the  analogue  for 
earthquake  waves  of  Snell  s  law  of  refractun  is  given  The  lectures  close 
with  a  short  account  of  multiple  integral  problems  and  with  a  discussion  of 
the  useful  maximum-minimum  principle, which  we  owe  t  j  Courant 

Care  has  been  taken  to  make  the  veatment  self-c  >ntained,and  details 
of  the  proofs  of  the  basic  mathematical  theorems  are  given 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  1 

The  Lagrangian  Function  and  the  Parametric  Integrand 

The  problems  of  the  calculus  of  variations  which  we  shall  treat 
in  these  lectures  belong  to  one  or  the  other  of  two  types.  The  simplest 
example  of  the  first  of  these  two  types  may  be  stated  as  follows: 

Given  two  points  in  a  plane,  or  in  3-dimensional  Euclidean  space, 
does  there  exist  a  curve  of  shortest  length  connecting  these  two  points  ; 
and,  if  so,  is  this  curve  unambiguously  determinate?  We  shall  refer  to 
the  problems  of  the  calculus  of  variations  which  belong  to  the  type  to 
which  this  problem  belongs  as  problems  of  Type  I .  An  example  of  the 
second  of  the  two  types  may  be  stated  as  follows: 

Given  a  closed  curve  in  3-dimensional  Euclidean  space,  does 
there  exist  in  this  space,  a  surface,  having  this  curve  as  its  boundary, 
whose  area  is  least  and,  if  so,  is  this  surface  unambiguously 
determinate?  We  shall  refer  to  the  problems  of  the  calculus  of 
variations  which  belong  to  the  type  to  which  this  problem  belongs  as 
problems  of  Type  2. 

Problems  of  the  calculus  of  variations  of  Type  1  are  curve  problems 
where  the  curves  with  which  we  are  concerned  may  be  plane  curves  or 
space  curves  or,  indeed,  curves  in  a  space  of  any  number  of  dimensions 


The  problem  of  the  motion  of  a  mechanical  system  may  be  conveniently 

stated  as  a  problem  of  Type  1  of  the  calculus  of  variations.  If  the 

mechanical  system  has  n  degrees  of  freedom,  n  =  1,  2,  .  ,  we 

write  its  generalized  coordinates  as  a  column,  or  n  x  1  ,  matrix  x  , 

so  that  the  j  th  coordinate  j  =  1  ,  .  ,  n  is  denoted  by  xJ  The 

velocity  n  x  1  matrix  x{  is  the  derivative  of  x  with  respect  to  the 

time  t  so  that  the  j  th  element  of  x(  is  x^(  ,  j  =  l,  ...  ,  n, 

and  the  Lagrangian  function  L  =  T-V  of  the  system  is  a  function  of 

the  two  n  x  1  matrices  x  and  x^,  and  also,  if  the  mechanical 

system  is  nonconservative,  of  the  time  t.  Here,  T  is  the  kinetic 

energy  and  V  is  the  potential  energy  of  the  mechanical  system.  We 

assume  that  L  is  a  continuous  function  of  the  matrices  x,  xt>and  also, 

if  the  system  is  nonconservative,  of  t  ;  by  the  statement  that  L  is 

a  continuous  function  of  the  two  matrices  x  and  x{  we  mean  that  L 

j  k 

is  a  continuous  function  of  the  2n  elements  x  ,  x  t,  j  =  1 . . 

k  *  1,  ...  ,  n,  of  these  matrices.  If,then,  x  =  x(t),  a^t^b,  is  any 
smooth  curve  C  in  the  n-dimensional  coordinate  space  of  the 
mechanical  system,  so  that  x  and  x{  are,  along  C,  continuous 
functions  of  t  over  the  interval  a  ^  t  ^  b  ,  the  Lagrangian  function 
L  is,  along  C,  a  continuous  function  of  t  over  the  interval 

a  <  t  <  b ,  and  we  may  consider  the  integral 

b 

1=  f  Ldt 

i 


2 


0 


This  integral  is  a  number  whose  value  depends  on  and  is 
unambiguously  determined  by  the  curve  C  just  as  the  length  of  a 
rectifiable  curve  in  a  plane,  or  in  3-dimensional  Euclidean  space, 
depends  on  and  is  unambiguously  determined  by  the  curve.  The 
motion  of  the  mechanical  system  is  such  that  its  paths,  i.e. ,  the 
curves  x  =  x(t),  a  <  t  ^  b  ,  which  describe  this  motion  are  such 

rt 

that  the  integral  I  -  I  Ldt  has,  when  evaluated  along  any  one  of 
these  paths,  a  stationary  value  (without  being,  necessarily,  a 
minimum  or  a  maximum).  Thus  the  integral  I  plays,  for  the 
mechanical  system,  the  role  played  by  the  length  of  a  rectifiable 
curve  in  the  introductory  example  we  have  given  of  problems  of 
Type  1  of  the  calculus  of  variations, but  there  is  one  essential 
difference:  I  has,  when  evaluated  along  a  path  of  the  mechanical 
system,  merely  a  stationary  value  while  the  length  integral  is  actually 
a  minimum  when  the  curve  of  integration  gives  it  a  stationary  value. 

In  this  connection  we  observe  that  the  time -coordinate  space  of  a 
mechanical  system  is  what  is  called  a  numerical,  rather  than  « 
metrical,  space;  the  concept  of  distance  between  two  of  its  points  is 
not  defined, so  that  it  does  not  make  sense  to  speak  of  nearby  points 
of  this  coordinate  space  nor  of  the  length  of  a  curve  in  it.  We  may. 


however,  endow  this  coordinate  space  with  a  metric  and  we  shall  do 
this  by  assigning  to  it  the  ordinary  Euclidean  metric  in  accordance 
with  which  the  distance  between  any  two  points  |  *  j  and  J^.J  is  the 
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«. 


magnitude  of  the  matrix  |x,_x|. 

In  order  to  treat  most  simply  the  problem  of  Type  1  of  the 
calculus  of  variations  which  is  furnished  by  the  motion  of  a  mechanical 
system  it  is  convenient,  particularly  when  the  system  is  nonconservative 
so  that  the  Lagrangian  function  L  involves  explicitly  not  only  the  two 
n  x  1  matrices  x  and  xt  but  also  the  time  variable  t  ,  to  place 
this  variable  on  an  equal  footing  with  the  n  elements  of  the  coordinate 
matrix  x.  To  do  this  we  replace  our  n-dimensional  coordinate  space 
by  a  (n  ♦  1)  -  dimensional  time-coordinate  space.  We  denote  n  ♦  1  by 
N  and  introduce  the  N  x  1  matrix  X  =  |  ^ !  whose  first  element  is  t 
and  whose  remaining  n  elements  are  those  of  the  n  x  1  matrix  x. 

A  smooth  curve  in  the  N-dimensional  time-coordinate  space  is  furnished 
by  a  formula  X  =  X(T),  a  4  B,  where  T  is  any  convenient  independ¬ 
ent  variable,  or  parameter,  of  which  the  N  x  1  matrix  X(  )  is  a 
continuously  differentiable  function.  If  we  choose  T  to  be  t  itself, 
the  first  of  the  N  equations  implied  by  the  formula  X  =  X(T)  is  simply 
t  *T  but,  in  general,  this  equation  will  be  replaced  by  t  =  t(T),where 
t(T)  is  either  smooth,  i.e. ,  possesses  a  continuous  derivative  or  is  at 
least  piecewise  smooth  over  the  interval  a  ^  T  B .  By  the  words 
piecewise  smooth, we  mean  that  t(D,  while  continuous  over  a  <  T  <  B, 
may  fail  to  be  differentiable  at  a  finite  number  of  interior  points  of  this 
interval;  at  each  of  these  points  it  possesses  a  right-hand  and  a  left- 
hand  derivative  but  these  derivatives  are  not  equal  At  all  points 
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where  t  is  defined  it  is,  by  hypothesis,  continuous.  We  make  one 
further  restriction  on  the  function  t  =  t(T);  namely,  we  assume  that 
t  >  o  at  all  points  where  t^.  is  defined.and  this  implies  that  at  each 
of  the  finite  number  of  points  at  which  t^  is  possibly  undefined 
the  right-hand  and  left-hand  derivatives  of  t  with  respect  to  7  are 
non-negative  since  the  right-hand  derivative,  for  example,  of 
t  at  T=  7j,  say,  is  the  limit,  as  (5-*  0  through  positive  values 
of  t j.(Tj  ♦  6).  The  reason  for  this  restriction  is  as  icilows:  We 
regard  t,  which  was  the  parameter  or  independent  variable  used 
by  Lagrange,  as  a  master  or  control  parameter  and  we  do  not  wish 
any  other  parameter  T  to  sometimes  increase  and  sometimes 
decrease  as  t  increases.  Thus  we  do  not  wish  y  to  change  sign 
over  the  interval  a  4  t.  We  could  satisfy  this  wish  by  requiring 
that  tr<  0  instead  of  0 ,  but,  since  a  mere  change  of  sign  of 
T changes  the  inequality  0  into  t^  0  there  is  no  real  loss  of 
generality  in  requiring  that  t  be  >  0  at  aU  the  points  of  the  interval 
a  <  T  ^  B  at  which  it  is  defined.  We  assume,  further,  that  the  number 
of  points,  if  any,  of  the  interval  a  4  T  4  ®  at  which  y  =  0  is  finite. 

When  we  pass  from  the  master  parameter  t  to  any  other  allowable 
parameter  T  by  means  of  a  formula  t  *  t(T),  a  4  T  4  ® integrand 
of  the  integral  I  is  changed  from  the  Lagrangian  function  L  to  the 

product  of  L  by  t—: 

b  7  A 

Is  Ldt  *  ^  b  * 
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We  denote  this  new  integrand  by  F  and  we  refer  to  F  as  the 

parametric  integrand,  the  original  integrand  L  being  termed 

the  nonpara  metric  ,  or  Lagranglan ,  integrand  of  our  problem  of 

Type  I  of  the  calculus  of  variations.  Under  an  allowable  change 

of  parameter  T  — > T 1  furnished  by  a  formula  T  =  T(T  ), 

L  remains  unaffected  while  t^«*  t  /  *  t^.7^/,  where 

Jj  is  positive  save,  possibly,  at  a  finite  number  of  points  of  the 
T 

interval  *  •  Since  the  value  of  an  integral  is  insensitive 

to  clauses  of  the  integrand  at  a  finite  number  of  points,  we  shall 
refer  to  T  /  as  positive  (rather  than  nonnefative  ).  Then  the  trans- 

T  I 

formation  T-+T  of  the  independent  variable,  or  parameter, 
induces  the  transformation  F-»F/  =  F  T  ,  of  the  integrand  of  our 
integral  I.  This  clange  of  the  integrand  is  necessary  to  ensure 
the  invariance,  or  lack  of  dependence  upon  the  particular  parameter 
adopted,  of  the  Integral  I  itself.  F  is  a  function  of  the  two 
N  *  1  matrices  X  and  and,  under  the  change  of  parameter 
7--»7  ,  the  second  of  these  is  multiplied  by  the  positive  factor  1^.1 
while  the  first  is  insensitive  to  this  change  of  parameter  Thus, 
k  being  any  positive  number,  the  parametric  integrand  FfX.X^J 
is  such  that 


F(X,kX^J  *  kF(X,Xr) 

We  express  this  Important  quality  of  the  parametric  integrand  F 
by  the  statement  that  F  is  a  positively  homogeneous  function,  of 
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degree  1,  of  the  N  x  1  matrix  Xj. 

Example. 

Denoting  the  rectangular  Cartesian  coordinates  of  a  point  in 

1  2 

3-dimensional  Euclidean  space  by  <t ,  x  ,  x  ), the  formula  which 


furnishes  the  arc -length  I  of  any  smooth  curve  in  this  space  is 

=  jl  +  (xlt)2  ♦  (x2t)2f  2  Under  the  change  of 


where  L 


parameter  t— >T  this  appears  in  the  form  I=J  F  d T, 


where  F  =  L  t  j  * 


vc  -  i-M’ • 


2  )2!1/2.  Thus  F  is  the 


magnitude, 


of  the  3  x  1  matrix 
Observe  that  F.  while  a  positively 


homogeneous  function,  of  degree  1,  of  X^,is  not  a  homogeneous 
function,  of  degree  1,  of  X  ;  when  is  multiplied  by  a  negative 

number  k,  F  is  multiplied  by  -k  =  jkj  . 

We  shall  assume  from  now  on  that  the  Lagrangian  function 


L  is  a  continuously  differentiable  function  of  the  N  x  1  matrix 
X  and  of  the  n  x  1  matrix  xt<  The  derivative  of  L  with  respect 
to  the  n  x  1  velocity  matrix  x(  is  a  1  x  n  matrix  which  is 
known  as  the  Lagrangian  momentum  matrix  and  which  is  denoted 
by  p  .  Similarly  the  derivative  of  the  parametric  integrand  F  , 
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which  is  a  continuously  differentiable  function  of  the  two 

N  x  1  matrices  X  and  with  respect  to  the  N  x  1  matrix 

T 

Xy,  is  a  1  x  N  matrix  P  ,  which  we  term  the  parametric 
momentum  matrix  In  both  of  these  differentiations  the 
N  x  1  matrix  X  is  supposed  to  be  held  fixed  The  first  element 
t  of  Xy  appears  in  both  the  factors, 

L  =  L  (X,  x  )  =  L  lx,  5J  ™d  V>  of  F  and  so 

*  l  4  /  ' 


T 


P'  =  W 


*  L 


Now  L 


v  L*t  ^ 


=  -pxt 


so  that 


P,  =  L  -  p», 


The  remaining  elements  of  Xy  namely,  the  elements  of  x^., 
appear  only  in  the  first  factor  L  of  F  and  so 


V'V'JV  1  =  2  N 

SL 


=  p  ,  i  .  t_  =  p  , 
J-lr-  T  ]‘l 


In  words:  The  first  element  of  the  1  x  N  parametric  matrix  P 
is  found  by  subtracting  from  the  Lagrangian  function  L,  the 
product  of  the  n  x  1  velocity  matrix  x{  by  the  1  x  n 
Lagrangian  momentum  matrix  p;  and  the  remaining  N  -  1  =  n 
elements  of  the  parametric  momentum  matrix  are  those  of  the 
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Lagrangian  momentum  matrix 

Exercise  1  Show  that  the  elements  of  the  parametric  momentum 


matrix  are  positively  homogeneous  functions  of  degree  aero,  of 
the  Nxl  matrix  Xj.  Hint:  Differentiate  the  relation 
r  (  X,  kXr)  =  kP  (  X,  Xr)  With  respect  to 

Exercise  1  Show  that  P  X^.  =  F  Hint:  Differentiate  the  relation 
F  (X,  kXy.)  =  kF  (  X  Xj.)  with  respect  to  k  and  then  set  k  =  1 
Exercise  3  Show  that  if  L  =  T-V  where  T  is  a  homogeneous 
function  of  degree  2  of  the  nxl  velocity  matrix  x(,  and  V  does 
not  involve  xt,  then  Pj  =  -  (T  *  V)  Hint .*  p  =  ,  pxt  =  2T, 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  2 

Extremal  Curves;  The  Euler- Lagrange  Equation 

The  Lagrangian  function  L  of  a  problem  of  Type  1  ^of  the 
calculus  of  variations  is  a  function  of  the  N  x  1  matrix  X  and  of 
the  n  x  1  matrix  We  introduce  the  (N  +  n)  x  1  =  (2n  +  1)  x  1 

matrix  z  whose  first  N  elements  are  those  of  X  and  whose 
last  n  elements  are  those  of  x^and  we  regard  the  elements  of  z 
as  the  coordinates  of  a  point  in  a  space  of  2n  ♦  1  dimensions.  This 
space,  which  is  known  as  the  state  space  of  the  mechanical  system 
whose  Lagrangian  function  is  L ,  is,  like  the  time-coordinate  space 
of  the  system,  a  numerical  rather  than  a  metrical  space.  We 
endow  this  numerical  state-space  with  a  Euclidean  metric  and  we 
consider  a  region,  i.e.  ,  an  open,  connected  (2n  ♦  1)  -  dimensional 
domain  D,  in  this  (2n  ♦  1)  -  dimensional  state-space  over  which 
L  =  L(z)  is,  by  hypothesis,  a  continuously  differentiable  function 
of  z  -  If  X  =  X(t),  a  t  <  b,  is  any  smooth,  or  piecewise-smooth, 
curve  C  in  the  time -coordinate  space,  each  point  of  C  at  which 
C  is  smooth  furnishes  a  point  of  the  state-space,  the  first  N 
elements  of  the  corresponding  matrix  z  being  those  of  X(t) 
and  the  last  n  elements  of  z  being  those  of  x(.  The  various 
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points  of  the  state-space  which  we  obtain  in  this  way  constitute  a 
curve  which  we  term  the  image  of  C  in  the  state-space.  We 
shall  confine  our  attention  to  those  piecewise-smooth  curves  C 
in  the  time-coordinate  space  whose  images  !  in  the  state-space 
are  covered  by  the  region  D  over  which  L  =  L  (z)  is,  by 
hypothesis,  a  continuously  differentiable  function  of  z. 

When  our  problem  of  Type  1  of  the  calculus  of- variations 
is  presented  parametrically,  our  integral  I  appears  as 

ffl 

1=1  F  (X,  XJ  d  T 

Ja  1 

the  curve  C  in  the  N-dimensional  time-coordinate  space  along 
which  I  is  evaluated  being  furnished  by  equations  of  the  form 
X  =  X(T) ,  a<T  <  B  . 

The  point  z  of  our  (2n  +  1)  -  dimensional  state-space, which  is 
furnished  by  any  point  of  C  at  which  C  is  smooth, has  as  its 
first  N  coordinates  the  N  elements  of  X(f)  and  as  its  last 
n  coordinates  the  n  mutual  ratios  of  the  N  elements  of 
XT(T),it  being  assumed  that  X^T)  is  not  the  zero  N  x  1 
matrix  save,  possibly,  at  a  finite  number  of  points  of  the  interval 
a  <  x  <  fl.  Thus  z  is  independent  of  the  particular  parameter 
chosen  to  describe  the  curve  C  of  integration.  The  parametric 
integrand  F(X,Xr  )  is  a  positively  homogeneous  function  of 
degree  1  of  the  N  x  1  matrix  Xj.  ,  and  the  value  of  the 
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integral  I  is  independent  of  the  parameter  adopted  to  describe  C. 

In  order  to  gain  some  idea  as  to  how  the  integral  I  varies  when 

the  curve  C  along  which  it  is  evaluated  is  varied,we  consider  the 

following  1  -  parameter  family  of  piecewise-smooth  curves 

C  :  X(T  s)  -  X(  T)  *  sfl  ' ).  a  ^  B  ;  -  C  <•  s  ^ 
s 

Here  s  is  the  parameter  which  names  the  various  members  of  the 

I  -  parameter  family, and  we  suppose  that  s  varies  over  a  closed 

interval  which  is  centered  at  s  =  0.  We  observe  that  reduces  to 

C  when  s  =  0,and  we  express  this  fact  by  the  statement  that  we 

have  imbedded  C  in  the  1  -  parameter  family  of  curves  Cg . 

f(T)  is  any  convenient  N  x  1  matrix  which  is  piecewise-smooth 

over  a  <  T  <  fi  and  which  reduces  to  the  zero  N  x  1  matrix  when 

T=a  and  when  '=6,  so  that  all  the  curves  of  our  1  -  parameter 

family  have  the  same  end-points.  suppose  that  the  various  images 

r  in  the  state-space,  of  the  curves  C  of  our  1  -  parameter 
s’  = 

family  are  all  covered  by  the  region  D  of  cur  state-space  over 
which  L  is,  by  hypothesis,  a  continuously  differentiable  function  of  t. 
Since 

Fx  =  t- 4;  P1  ■  L-f“f  Vi-i' 2 . 

it  follows  that  F  is,  over  D ,  a  continuously  differentiable  function 

/X  i 

of  the  2N  x  1  matrix  Z  MXj  whose  first  N  elements  are  those 

\ 

of  X  and  whose  last  N  elements  are  those  of  Xr  .  When  I  is 
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evaluated  along  any  curve  Cg  of  our  1  -  parameter  family, its 
value  is  a  function  of  s  which  is  furnished  by  the  formula 

I(s)  =  |  F  (X(T)  *  s fir),  Xj.Cn  *  Sf  <T))dT  . 

I(s)  is,  for  each  value  of  s  in  the  interval  -(5<  s  ,  a 
differentiable  function  of  s  ,  us  derivative  being 

rt 

I  (s)  =  Fv(x(T)  -  sf(T),  X^<7)  *  sfJ~))f<r)<iT 
s  'a  * 

*  J  Bp (X(T)  -  s Hr),  xr(T)  *  s un)  fj~) d~ 

CM 

We  denote  by  f  I  the  differential  of  I(s)  at  s  =  0  and  we  term  C  I 
the  variation  of  I.  Thus  JI  is  the  product  of  Ig(f))  by  ds, where 
ds  is  an  arbitrary  number  (which  we  may  take  to  be  1).  Similarly, 
we  denote  by  X  the  differential,  with  respect  to  s,  of  X(7s) 
at  s  =  0  .and  by  6x^  the  differential,  with  respect  to  s, 
of  Xr(7,s)  at  s  =  0  so  that 

J  X  =  ds. fD ;  6x^=ds.fr(r) 

We  observe  that 

(fx),  = 

( 

and  we  express  this  result  by  the  statement  that  the  order  of 
variation  and  differentiation  with  respect  to  7  is,  when  these 
operations  are  applied  to  X(  T,  s),  immaterial.  In  this  notation, 
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then,  we  have 

<5i »  J6jr3t(x(r),  x_<n)  fx.p(xn,  xjn)  f  xf|dr 

In  order  to  simpUfy  this  expression  we  observe  that 

P  (X(f),  X  ;T))  is  continuous  over  a  <  T<  fi  save,  possibly, 

X  T 

for  a  finite  number  of  points,  namely,  the  points  which  furnish  the 
points  of  C  at  which  xTis  not  defined.  Thus  the  1  x  N  matrix 

function  _ 

G  n-J  FX(X(T),  X^(T))d 

is  defined  and  is  continuous  at  all  points  of  the  interval  a  <  B  . 
If  r=  T  is  a  point  of  the  interval  a<T<B  at  which  xt  is  not 
defined, both  G  -o)  and  G(yo)  exist  and  are  equal,  their 

common  value  being  G<  ^).  Furthermore,  at  those  points  of  C 
at  which  xT  is  defined,  Gr  =  Fx  so  that  dG  =  d  r.  Gr  =  dT.  Fx . 
Thus  the  matrix  product  Fx(xO,  Xf(T))  5x  may  be  integrated 
by  parts  to  yield 

/fl{Fx(x(r),  x^)'5ijdr-(^!a  |c<'!x)r^dr 

a  ' 

and,  since  5x  =  ds.f(T)  is  zero,  by  hypothesis,  at  r=a  and 
at  T~  fl,  this  reduces  to  G(5x)^|dr.  Hence,  since 

^Xr  =  ((5x)r,  (3 1  appears  as 

-et  r  ; 

£l  =  ;<P  -  G)(CX),  >df 
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It  is  clear  that  81  will  be  zero  for  all  allowable  choices  of  HT) 
y;  tbe  1  x  N  matrix  P  -  G  Is  coostaat  along  C  ;  Indeed,  if  this 
is  the  case,  £  |(P  -  G)(6X>r  }dT  is  of  the  form 

N  fi  H  ifl 

E  A.  f  (8XJL.dr  *  Z  A.  6XJ  ,  where  A  ,  H,  Is 

j=l  i  h  T  j=l  1  « 

constant,  and  this  is  zero  since  6X*  *  ^(D  ds,  j  =  1 . N,  is 

zero  by  hypothesis,  at  f  =  a  and  at  7*  A.  This  sufficient 
condition  for  81  to  be  zero,  for  all  allowable  choices  of  f(f), 
is  also  necessary.  To  see  this  we  obsenre  that  81  may  be  written 
in  the  form  f* \(J>  -  G  -  A)(8X)^  d  7.  where  A  is  any  constant 
1  x  N  matrix,  and  we  consider  the  1  z  N  matrix  function, 

(p  -  G  -  A)  d  T  .  It  is  clear  that  H(«)  =  0,  and  we  may 

rB 

determine  A  by  means  of  the  formula  (A  -  o)A  =  J  (P  -  G)d  T 


so  that  H(fi)  -  0,  H(7)  is  piecewise-smooth  over  a£T4B, 

its  derivative  at  any  point  T  of  this  interval,  which  furnishes  a 
point  of  C  at  which  C  is  smooth,  being  P  -  G  -  A.  Hence  we 
may  take  as  our  N  x  1  matrix  HT)  the  transpose  H*(f) 
of  H(7) ,  and,  when  we  do  this,  (OX)^.  =  ds.  (P  -  G  -  A)  at  all  the 
points  of  C  at  which  C  is  smooth.  Thus 

51  =  ds  ^6|(P  -  G  -  A)(P  -  G  -  A*  jdT  is  the  proAict  by  ds  oi 
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the  integral  along  C  of  the  squared  magnitude  of  the  1  x  N  matrix 
P  -  G  -  A  ,and  so  for  51  to  be  zero,  P  -  G  -  A  must  be  zero  at 
all  the  points  of  the  ;nterval  a  4  T  4  6  at  which  it  is  continuous. 

Thus  we  have  *he  following  important  result- 

The  necessary  and  sufficient  condition  that  51,  when  evaluated 
along  C  ,  be  zerc  for  all  allowable  choices  of  the  N  x  1  matrix  f(f), 
is  that  P  -  G  *  A  at  all  the  points  of  C  at  which  C  is  smooth,  the 
1  x  N  matrix  A  being  constant  along  C. 

Since  G (T)  is  differentiable,  with  derivative  F„,  at  all  the 
points  of  the  interval  a  <  T  4  fi  which  furnish  smooth  points  of  C 
it  follows  that  at  all  smooth  points  of  C, 


This  equation  is  known  as  the  Euler- Lagrange  equation  for  problems 
of  Type  1  of  the  calculus  of  variations.  We  term  any  piecewise-smooth 
curve  along  which  it  holds  an  extremal  curve  of  F  or  of  the  Lagrangian 
function  L.  The  laws  of  mechanics  for  systems  which  possess  a 
potential  energy  function  may  be  stated  as  follows: 

The  paths  or  curves  in  the  time-coordinate  space,  traced  by 
the  mechanical  system  are  extremal  curves  of  the  Lagrangian  function  L, 
or,  equivalently,  of  the  parametric  integrand  F. 

In  differentiating  the  relation  P  =  G  *-  A  ,  ;n  order  to  obtain 
the  Euler- Lagrange  equation  Pr  -  F^>  v-  have  lost  the  fact  that  A  is 
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cons-ant  along  C  ,  since  the  relation  Pf  =  Fx  would  hold  if  A 

were  merely  piecewise -constant  along  C.  If  Tj  is  a  point  of  the 

interval  a  <  B  which  furnishes  a  point  of  C  at  which  x^  is 

net  defined, we  know  that  G^-O)  =  GfTj+O)  and  this  implies  that 

P(^-O)  =  P(Tj  *0/.  The  parametric  momentum  matrix  P(T) 

is  not  defined  at  bu»,  on  assigning  to  it  at  \  the  common 

value  of  P<  Tj-0)  and  P<  7^-0),  we  see  that  it  is  defined  and 

continuous  at  all  the  points  of  the  extremal  curve  C.  This  is  a 

remarkable  fact  since  the  velocity  matrix  xt  is  not  defined  at 

the  points  of  C  at  which  C  fails  to  be  smooth.  The  last  n 

coordinates  of  P  are  those  of  the  Lagrangian  momentum  matrix 

p  =  L  which  we  may  regard  as  a  function  of  xt ,  the  N  x  1 
xt 

matrix  X  being  held  fixed.  Let  us  denote  by  the  n  x  1 

matrix  which  is  the  transpose  p*  of  the  lagrangian  momentum 
matrix  p  and  let  us  suppose  tha*  Lx*  is,  over  the  region  D  of 

our  (2n  *  1)  -  dimensional  state -space,  a  continuously  differen¬ 
tiable  function  of  the  n  x  1  velocity  ma’rix  xt  Then  the 

Jacobian  matrix  L  •  of  p*  -*  Lx*  with  respect  to  xt  is  a 
*t  t  * 

symmetric  nxn  matrix  of  which  the  element  in  the  J  th  row 

,  .  ,  •*,  u  ik=l  n.  If  this  matrix  is 

and  k  th  column  is  L  j  k  ,  J,  *  4  *  •  •  * 

t  t 
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nonsingular  over  D,  the  relation  p  =  L  defines,  over  D,  x 

t  1 

as  a  function  of  X  and  p,and  so  the  relation  p(  7^-0)  =  p(  7^+0) , 

which  is  a  consequence  of  the  relation  P(T^-O)  =  P(T^+0),  forces 

the  equality  xt(  7^-0)  =  x{(  7^+0) .  In  words: 

Any  extremal  curve  C  of  L  is,  when  the  n-dimensional 

matrix  L  •  exists  and  is  continuous  and  nonsingular  over  D , 
xt  Xt 

not  merely  piecewise -smooth  but  actually  smooth,  x^  existing  and 

being  continuous  at  all  the  points  of  C  . 

If  we  assume,  in  addition,  that  the  n  x  1  matrix  L  t 

Xt 

is,  over  D,  a  continuously  differentiable  function  of  the 
(2n  +  1)  x  1  matrix  z  =  j*  j  and  not  merely  of  the  n  x  1 

matrix  x^ ,  then  the  theory  of  implicit  functions  assures  us  that 

the  function  xt(X,p)  of  X  and  p  which  is  defined  implicitly  by 

the  formula  L  «  =  p*  is  a  continuously  differentiable  function  of 
xt 

the  (2n  +  1)  x  1  matrix  |  p*|’  Since  this  (2n  +  1)  x  1  matrix 
is  continuously  differentiable  along  C,  it  follows  that  x^  is 
continuously  differentiable  along  C  so  that  xtt  exists  and  is 
continuous  at  all  the  points  of  C  .  In  words: 

Any  extremal  curve  C  of  L  is,  when  the  n  x  (2n  +  1) 

matrix  L  *  exists  and  is  continuous  over  D  (its 

xt  z 
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n-dimensional  sub-matrix  L  *  being  nonsingular  over  D) 

t  t 

not  merely  smooth  but  possessed  of  continuous  curvature  x 

’  tt 

being  defined  and  continuous  at  all  the  points  of  C. 

Along  C  the  parametric  momentum  matrix  P  is  continuously 
differentiable, its  derivative  being  furnished  by  the  Euler- Lagrange 
equation,  P  =  .  The  last  n  of  the  equations  furnished  by 

this  1  x  N  matrix  equation  may  be  written  as  p^  =  L  or, 
equivalently,  on  taking  the  transpose  of  this  1  x  n  matrix 
equation,  as 

Lx«xt  xtt  *  Lx*X  Xt  =  Lx* 

This  nxl  matrix  equation,  which  furnishes  xtt,  along  any 
extremal  curve  of  L,  as  a  function  of  the  (2n  +  1)  x  1  matrix 
,  is  the  Euler- Lagrange  equation  for  extremals  which 
possess  continuous  curvature.  We  shall  from  now  on  suppose 
that  L  possesses  the  following  two  properties  which  guarantee 
that  all  its  extremals  possess  continuous  curvature: 

1)  exists  and  is  continuous  over  D. 

2)  L  «  is  nonsingular  over  D. 

xt  xt 
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Exercise  1.  Show  ‘.hat  if  F  or,  equivalently,  L  does  not  involve 
any  given  one  of  the  elements  say,  of  X  then  the  corresponding 

element  P.  of  the  parametric  momentum  matrix  is  constant  along 
■»ny  extremal  curve  C  of  F  . 

Exercise  2.  Show  that  the  Principle  of  Conservation  of  Energy  holds 
for  any  mechanical  system  whose  Lagrangian  function  does  not 
involve  i  explicitly.  Hint :  P^  =  -  (T  +  V).  Note.  In  view  of  the 
res  lit  <  '  this  exercise, a  mechanical  system  whose  Lagrangian 
function  does  not  involve  t  explicitly  is  termed  conservative. 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  3 

Lagrangian  Functions  Which  Are  Linear  in  x^ 

When  defining  an  extremal  curve  C  of  a  given  Lagrangian 
function  L  we  imbedded  C  ,  which  we  assumed  at  the  beginning 
to  be  merely  piecewise-smooth,  in  a  t-parameter  family  of  curves 
X(f,  s)  =  X(T)  ♦  sf  (T)  ,  a  fi,  -  6  ^  s<  6 
where  f  (T)  is  any  convenient  N  x  1  matrix  which  is  piecewise- 
smooth  over  a^r^8  and  which  vanishes  at  T  -  a  and  at  T  -  B  . 
We  found  out  later  that  if  L  satisfies  some  not  very  restrictive 
conditions,  C  must,  if  it  is  to  qualify  as  an  extremal  curve  of  L, 
be  smooth  and,  in  addition,  possess  continuous  curvature.  Despite 
this  fact.it  is  convenient  to  permit  the  comparison  curves 
X  =  X(T,  s) ,  a  *  r<  B  ,  -  6  ^  s  4  5  ,  to  be  only  piecewise-smooth 
so  that  f(T)  may  fail  to  be  differentiable  at  a  finite  number  of 
points  of  the  interval  a  <  6 .  We  do  not  impose  add.tional 

restrictions  upon  our  extremal  curve  if  we  imbed  it  in  a  family, 

X  =  X{T,  s),  -Ks<5,  which  does  not  involve  the 

parameter  s  linear ly^  provided  that  the  N  x  1  Matrix  function 

X(T,  «)  ia  auch  that  lXr  =  (flX)  or,  equivalently,  that  the  two 

<  7 
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mixed  second-order  derivatives  X_  and  Xg  r  exist  and  are  equal 
save,  possibly  for  a  finite  number  of  points  of  the  interval  a  ^  6 

This  wili  certainly  be  the  case  if  these  derivatives  exist  and  are 
continuous  over  the  rectangle  a  (  v  6,  •  5  \  s<  5,  with  the 
possible  exception  of  a  finite  number  of  values  of  ^<e  may  also 
imbed  our  extremal  curve  C  in  a  k  -  parameter  family  where 
k  is  any  integer  ^  1  In  this  case  s  is  a  k  x  1  matrix,  the 
parameter  matrix  of  the  family,  and  X^,g  ,  X s^.  are  N  x  k 
matrices  which  we  assume  to  exist  and  be  continuous  over  the 
(k  ♦  1)  -  dimensional  interval  a  T4  B  -6  s^  5*  1  =  1  .  .  k  • 

This  assumption  guarantees  the  equality  of  these  two  N  x  k  matrices, 
the  possible  existence  of  a  finite  number  of  exceptional  values  of  I 
being  always  allowed  for .  We  assume  that  when  s  is  the  kxi  zero 
matrix, the  curve  X  =  X(  T,  s) ,  K  B,  of  our  k  -  parameter 
family  reduces  to  the  curve  C  in  which  we  are  interested,  so 
that  X(  f,  0)  =  X(T) ,  a  4  6  .  Xg  is  now  a  Nxk  matrix  and 

the  variation  6X  of  X  is  defined  by  the  formula 
5X  =  Xs(T,  0)  ds 

where  ds  is  an  arbitrary  k  x  1  matrix  Our  formula  for  51 
has  precisely  the  same  form  as  before;  namely, 

fiI  =  J*  lFx(x(n'  xr(n)5x*  p(x(r),  xr(r))fixr|dr 
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and  the  right-hand  side  of  this  equation  is  of  the  form 

6  I  *  .  *  6.  I  where  5.  I ,  for  example,  is  furnished  by  the 

1  K  * 

formula 


V 


(X(T),  Xr(T);  ix(T) 


P  (  X(T) ,  Xr(7))(f1(T))r 


where  J)  =  X  i  ( r,  0) .  61  will  be  zero  for  an  arbitrary  choice 


of  ds  =  ,  :  ,  if,  and  only  if,  each  of  the  partial  variations 

WJ 

3  I ,  ...  ,  6.1  is  zero, and  each  of  these  partial  variations  will  be 

1  lv 

zero,  by  the  argument  given  in  Lecture  2,  for  arbitrary  piecewise- 

smooth  Nxl  matrices  /jd) . fk(D,  all  of  which  vanish 

at  a  and  at  T=  fi  if,  and  only  if,  the  Euler- Lagrange  equation, 
Pr  =  Fx  ,  holds  along  C 

Amongst  the  extremal  curves  of  L  or  of  F  are  to  be  found, 
if  any  such  exist,  the  minimal  curves  and  the  maximal  curves  of  L. 

A  piecewise -smooth  curve  C  in  the  N  -  dimensional  time- 
coordinate  space  Is  said  to  be  a  minimal  curve  of  L  if  the  inequality 

I  (0)  <  I  (s) 

Is  valid,  no  matter  what  is  the  family  of  piecewise -smooth  curves 
in  which  C  is  imbedded,  provided  only  that  the  parametric  interval 
.  «  s  <  6  (or  -ak  s1  C  8* .  j  =  1  ,  . .  .  k  .  if  the  family  is  a 
k  -  parameter  one)  is  sufficiently  small.  It  follows  from  the  theory 
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of  maxima  and  minima  of  functions  of  one  or  more  riables  that  a 
necessary  but  by  no  means  a  sufficient,  condition  for  C  to  be  a 
minimal  curve  of  L  is  that  C  be  an  extremal  curve  of  L  Fora 
maximal  curve  of  L  the  inequality  I  (0)^  I  is)  is  reversed  to 
read  1(0)  >I(s),  and  it  follows  that  a  maximal  curve  of  L  is 
a  minimal  curve  of  -  L  There  is,  then,  no  loss  of  generality  in 
confining  our  attention  to  minimal  curves.and  we  shall  do  this. 

Since  everv  minimal  curve  of  L  ,  if  any  such  exists,  is  an 
extremal  curve  of  L  we  know  that  if  L  possesses  the  two 
properties  described  in  Lecture  2;  namely, 

1)  L  m  exists  and  is  continuous  over  D, 

-f* 

2)  L  *  l s  nonsingular  over  D, 

*t  Xt 

then  every  minimal  curve  of  L  which  is  covered  by  D  is  not 

only  smooth  but,  in  addition,  possesses  continuous  curvature. 

The  question  of  the  existence  of  minimal  curves  of  L  is  a  difficult 

one,  compared  to  which  the  question  of  the  existence  of  extremal  curves 

of  L  is  relatively  superficial.  For  mechanical  systems  L  is  always 

a  quadratic  function,  not  necessarily  homogeneous  of  the  elements  of 

x.  the  coefficients  of  this  quadratic  function  depending  in  general 
^  * 

on  the  Nxl  time -coordinate  matrix  X.  V»e  continue,  however, 
to  use  the  symbol  L  to  stand  for  the  nonparametnc  integrand  of 
any  problem  of  Type  1  of  the  calculus  of  variations  whether  or  not 
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this  problem  is  one  connected  with  the  motion  of  a  mechanical  system, 
and  we  proceed  to  discuss  a  problem  where  L  has  a  particularly 
simple  form: 

L  is  a  linear  function  of  x 


Writing  L  =  rx  q ,  where  r  »s  a  l  x  n  matrix,  we  assume 
that  the  1  x  N  matrix  R  =  (q,r)  is  a  continuously  differentiable 
function  of  the  N  x  1  matrix  X  =  J  over  some  region  of  the 
N  -  dimensional  time -coordinate  space  which  covers  the  curves  C 
in  which  we  are  interested.  Here  F  =  RX^  so  that  P  =  R. 

Taking  the  transpose  of  this  1  x  N  matrix  equation, we  have 

*  *  •  •  • 

P  =  R  so  that  P  f  -  R  .where  R  ^  is  the  N  -  dimensional 

matrix  of  which  the  element  in  the  j  th  row  and  kth  column  is 

•  • 

(R  )  k  f  ],  k  =  1 . N.  Writing  F  in  the  form  X^  R  we  have 

J  X^ 

•  •  |  •  • 

Fv  =  X._  R  v  or,  equivalently,  F_*  =  |R  Y(  X  ,  and  so  the 

A  7  A  A  ,  A,  7 

Euler- Lagrar.ge  equation  of  our  problem  of  Type  1  of  the  calculus 
of  variations  appears  in  the  form 


sxT  -  0 

where  S  is  the  alternating  N  -  dimensional  matrix 

If  S  is  nonsingular.  X^  must  be  the  zero  N  x  1  matrix  and  so 

our  problem  does  not,  m  general,  possess  extremals.  Observe  that 

L  •  *  r  so  that  L  *  Is  the  zero  n-dimensional  matrix;  if, 

*t  XtXt 
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then,  L  does  possess  an  extremal, this  extremal  need  not  possess 
continuous  curvature  nor  even  be  smooth. 


If  the  matrix  S  is  identically  zero  over  a  region  of  our 
N-dimensional  time-coordinate  space  which  covers  the  curve  C 
in  which  we  are  interested, the  Euler -Lagrange  equation 
SX  0  becomes  vacuous  so  that  any  piecewise -smooth  curve 

which  is  covered  by  this  region  is  an  extremal  curve  of  L  The 

element  m  the  ]  th  row  and  the  k  th  column  of  S,  ]  ,  k  -  1  , .  .  •  ,  N 
,s  (n  )  .  ^  and  so  S  is  identically  zero  when,  and  only 

when,  R  is  the  gradient  of  a  differentiable  point  function  0(X) 
over  a  region  of  our  time -coordinate  space.  When  this  is  the  case, 

f  =  (frad0) X  =  0^  and  l  -  f  0  d 0(X(B) )  -  0(X(a) ) 

T  i  Jn  * 


are  the  same  for  all  the  curves  of  the  family  in  which  we  suppose 
C  to  be  imbedded.  Thus  I(S)  -  1(0)  =  0  so  that  not  only  is  every 
piecewise -smooth  curve  which  is  covered  by  the  region  of  our  time 
coordinate  space,  over  which  we  suppose  S  to  be  the  zero 
N-dimensiona'  matrix,  an  extremal  curve  of  L  but  every  such 
curve  is  at  once  a  minimal  and  a  maximal  curve  of  L. 


Exercise 

Show  that  any  two  Lagrangian  functions,  and  Lj,  whose 
difference  is  of  the  form  rx^  *  q,  where  R  ^q,  r)  is  the 
gradient  of  a  differentiable  point  function  over  a  region  of  the 
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N -dimensional  time-coordinate  space  possess  the  same 
extremal  curves  over  this  region.  Note.  It  follows  from  the 
result  of  this  exercise  that  if  we  are  only  interested  in  the 
extremal  curves  of  L  ,  we  may  neglect  in  L  all  terms  of 
the  type  rxt  ♦  q  where  R  =  (q  ,  r)  =  grad<£  (X)  .  In  particular, 
we  may  neglect  all  terms  of  the  type  rx(  -  q  ,  where  the 
1  x  n  matrix  r  and  the  point  function  q  are  constants. 
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Lectures  on  Applied  Mathema’ics 
The  Calculus  of  Variations 
Lecture  4 

The  Legendre  Condition  for  a  Minimal  Curve 


We  assume  that  our  Lagrangian  function  L  =  Liz) 
possesses,  over  a  given  region  D  of  the  i2n  ♦  1)  -  dimensional 
state -space, the  following  properties 

1)  L  is,  over  D  a  continuously  differentiable  function 

of  the  i2n  *  1)  x  1  matrix  z  = 


X 

x. 


2)  The  n  x  i2n  *  1)  matrix  L 


exists  and  is  continuous 


over  D  and  the  n  -  dimensional  sub- matrix  L  %  of  this 

t  xt 

n  x  (2n  ♦  l)  matrix  is  nonsmgular  o*er  D 

Then  any  piecewise -smooth  curve  C  in  'he  N  -  dimensional 
time -coordinate  space  whose  image  I  in  the  <2n  ♦  1)  -  dimen¬ 
sional  state-space  is  covered  by  D  .  is  an  extremal  curve  of  L 
if,  and  only  if. 


1)  C  is  smooth  and  po**esses  continuous  curvature, 

2)  C  satisfies  'he  Euler- Lagrange  equation  p  -  L^ 


or,  equivalently,  p  =  L^*  ;  i.e. , 
Lx*xt  *n  *  Lx*X  Xt  =  Lx* 


This  Euler- Lagrange  equation  defines  x  ,  along  C,  as  a 
continuous  function  of  z  -  x^l.and  if  this  function  of  z  is 
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not  only  continuous  but  continuously  differentiable  the  existence 
and  uniqueness  theorem  for  ordinary  differential  equations 
assures  us  that  there  passes  through  each  point  of  D  an 
unambiguously  determinate  image  P  of  an  extremal 
curve  C  of  L  We  shall  then,  m  order  to  avail  ourselves 
of  this  uniqueness  theorem,  make  the  following  additional 
assumption  concerning  the  Lagrangian  function  L 

3)  L  «  -  p*  possesses,  over  D,  con' mucus  second- order 
x  t 

derivatives  with  respect  to  the  elements  of  the  t2n  •  1)  x  1 
matrix  z,and  is,  over  D,  a  continuously  differentiable 

function  of  z 

We  know,  then,  that  there  passes  through  each  point  of  D  an 
unambiguously  determinate  curve  which  is  the  image  of 

an  extremal  curve  C  of  L .  This  curve  C  is  not 
necessarily  a  minimal  curve  of  L  ,and  we  proceed  to  investi¬ 
gate  conditions  which  are  imposed  on  L  along  C  by  the 
requirement  that  C  be  a  minimal  curve  of  L  This  investi 
gation  of  necessary  conditions  on  L  along  C  for  C  to  be  a 
minimal,  and  not  merely  an  extremal,  curve  of  L  is  a 
preliminary  one.  our  ultimate  goal  is  the  procurement  of 
conditions  on  L  which  are  sufficient,  t  e  strong  enough,  to 
ensure  that  the  given  extremal  curve  C  of  L  will  be 
actually  a  minimal  curve  of  L  It  may  well  happen  that  the 
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necessary  conditions  which  we  ob’ain  ma\  not  bo  strong  enough 
to  be  sufficient  and,  conversely,  it  may  well  happen  that  the 
sufficient  conditions  which  we  obtain  may  b*  *oo  strong,  in  the 
sense  that  they  are  not  necessary  The  ideal  would  be  a  set  of 
conditions  on  L  which  are  at  once  necessary  and  sufficient  to 
ensure  that  a  given  extremal  curve  C  of  L  should  be  a 
minimal  curve  of  L;  but  we  shall  not  achieve  this  ideal  in 
these  lectures. 

Note:  The  n  equations  implied  by  the  n  x  1  matrix  equation, 

L  x  ♦  L  *  „  X  L  *.art  the  last  n  of  the  N  equations 
x;  xf  tt  x*  X  t  x 

« 

implied  by  the  Nxl  matrix  equation  P_  Fx*  ,  or, 

equivalently,  Fx*x  X _ ♦  Fx«  XX,  Fx*  »  on  the 

assumption  that  t  t(T)  possesses  a  continuous  second 
derivative  over  a  C  .  The  question  naturally  arises 

What  about  the  first  of  these  N  equations'1  The  fact  that  F 
is  a  positively  homogeneous  function,  of  degree  1,  of  X  . 
assures  us  that  PX^  -  F  and  on  differentiating  this  equation 
with  respect  to  T  along  C,  we  obtain 
p^X  *  PXrr  F^  *  PXr  ;  or,  equivalently, 

(p  -  F  )  X  -  0  •  If,  then,  the  last  n  of  the  N  equations 

I  X 

implied  by  the  matrix  equation  P^  -  Fx  are  satisfied, 
so  also  will  be  the  first  (since  tf  '  0  at  ail  but  a  finite  number 
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of  points  of  the  interval  n'vfl  and  since  and  Fx  are 

continuous  over  this  interval). 

The  first  necessary  condition  we  shall  determine  which  an 

extremal  curve  C  of  L  must  satisfy  in  order  that  it  may 

qualify  as  a  possible  minimal  curve  of  L  concerns  the 

„  -  dimensional  matrix  La,  which  we  have  already  assumed 

*  t  t 

to  be  continuously  differentiable  and  nonsingular  over  D.  This 
condition,  known  is  the  Legendre  condition,  states  that: 

The  quadratic  form  u  Lx*x  u,  where  u  is  any  real 

n  x  1  matrix  of  unit  magnitude,  must  not  be  negative  at  any  point 
of  C . 

This  condition  is  what  we  may  term  a  local  condition;  it  concerns 
the  nature  of  the  n  -  dimensional  matrix  Ly*  ^  at  a  given 

(but  arbitrarily  chosen)  point  of  C  .  To  prepare  for  the  proof 
of  the  Legendre  condition,  which  proof  we  shall  complete  in  the 
next  lecture,  let  4  be  any  interior  point  of  the  interval  *  ^  b 
and  let  the  equations  of  C  ,  in  terms  of  the  Lagrangian  parameter  t , 

X  =  X(t) ,  a  <  t  <  b 

Since  we  are  concerned  only  with  the  behavior  of  L%#  ^  at 

the  point  X(tj)  of  C,we  consider  a  positive  number  5’  which  is 

so  small  that  the  closed  interval  4  -  6*  <  t  <  4  ♦  8'  is  covered 
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by  the  interval  a  ■>  t  b,and  we  denote  by  s  any  positive 
number  x  .  Let  f  (t)  be  the  nx  1  matrix  function  of  t  that 
is  defined  as  follows: 

/  (t)  =  o  if  a  ^  t  ^  tj  -  s'  and  if  tj  *  s’  <  t  ^  b 
f  (t)  =  (t  -  tj  ♦  s’)  u  if.  tj  -  s '  ^  t  •„  tj 

f  (t)  =  (tj  *  s'  -  t)  U  If  tj  ‘  t  tj  -  s’ 

where  u  is  any  given  constant  n  x  1  matrix  of  unit  magnitude, 

so  that  u*  u  -  1  .  Then  f  (t)  fails  to  be  differentiable  at  the  three 

points  tt  -  s'  ,  tj  ,  t1  *  s'  of  the  interval  a  t  ^  b  and  the 

various  curves  C  ,  of  the  2  -  parameter  family 
s,s 

X  =  X  (t,  s,  s')  -  X(t)  *  sf  (t),  avUb,  •5'sS45,-{,n<s''  5  '  - 

are  piecewise*smooth  over  a  <-  t  ^  b  .  The  curves  of  this  family  for 

which  s  -  o  coincide,  no  matter  who*  is  the  value  of  s’  in  the 

interval  •5%s'^5',  with  our  given  extremal  curve  C  and, 

when  s*o,  the  curve  C_  ,  coincides  with  C  save  for  the 

interval  -  s*  <:  t  <  ♦  s f  ,  which  interval  we  may  take  to  be 

as  small  as  we  find  convenient.  All  the  images  g  s»  of  the 

curves  C.  g,  are  covered  by  D  if  5  is  sufficiently  small, 

but  we  shall  need  in  our  argument  a  bounded  and  closed  subset  of 

D  which  covers  all  the  images  [  g  g,  of  the  curves 

C  and  more  than  this,  a  sufficiently  small  neighborhood  of 

s.s’ 

each  point  of  each  of  these  images.  The  reason  for  this  need  is 
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that  we  shall  have  to  avail  ourselves  of  the  fact  that  any  function 


which  is  continuous  over  a  bounded  and  closed  point-set  is  bounded 
over  this  point-set;  our  region  D  is  open  and  we  have  no  assurance 
that  a  function  which  is  continuous  over  D  is  bounded  over  D. 

In  order  to  construct  the  bounded  and  closed  subset  of  D  which 
we  require, we  first  observe  that  there  is  no  lack  of  generality  in 
taking  D  to  be  bounded;  for  the  image,  H  of  C  is  bounded  and 
we  may  replace  D,  if  it  is  unbounded,  by  the  intersection  of  D 
and  any  open  (2n  ♦  1)  -  dimensional  interval  which  covers  T. 

The  points  of  the  various  images  lg  g,  of  the  curves  Cg  g, 
of  our  2  -  parameter  family  constitute  a  closed  subset  of  D, 
since  the  three  intervals  a  <  t  b,  -  5  <  s^6,  -  5' 4  s'  4  5 


are  all  closed,and  we  denote  by  d  any  positive  number  which  is 
less  than  one-half  the  distance  of  this  closed  subset  of  D  from 
the  closed  boundary  D'  of  D.  Finally,  we  denote  by 


the  closed  subset  of  D,  which  consists  of  those  points  of  D  whose 
distance  from  D’  is  d  Since  the  distance  of  any  point  of  any 
one  of  the  images  :  s  from  D  is  -  2d  ,  it  is  clear  that  all 
the  images  ■"*  .  are  covered  by  Drf  ;  the  distance  of  any 

point  of  our  (2n  *  1)  -  dimensional  space,  whose  distance  from 
any  point  of  any  one  of  the  images  1  s  s*  is  <  d,  is  by  virtue 
of  the  triangle  inequality  •  2d  -  d  -  d,and  so  covers  any 
d- neighborhood  of  any  point  of  any  one  of  the  images  f  ,  . 
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If  C  is  to  qualify  as  a  minimal  curve  of  L.the  difference 
Ifs,  s')  -  l(0,s  )  must  be  m.nne^ative  provided  only  that  5  is 
sufficiently  small,  1(0,  s')  bein£,  no  matter  what  the  value 
of  s’,  the  integral  of  L  along  C  This  difference  is  the 
integral  of  L(X,  xt)  -  L(X,xt)  over  the  interval 
t  -  s’  v  tj  •  t ^  *  s',  and  we  write  L (X,  xj  -  L (X, x{)  in  the 
form  L..  *^2  ,  where 

A,  =  LlX.x,)  -  LlX.x,) 

■2  =  LlX.x,)  -  L(X,x() 

t,  *  s’  t.  -  s’ 

Thus  "  1  x  f1  \ 

Ks, s')  -  1(0, s’)  =  ls  dt  -  ,  ~^2  dt 

'  h  ‘ s’  ri  ’  s 

and  we  shall  treat, in  detail,  in  our  next  lecture  the  two 
integrals  on  the  right-hand  side  of  this  equation. 
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Lectures  on  Applied  Mathem-.ucs 
The  Calculus  of  Variations 
Lecture  5 

Proof  of  the  Legendre  Condition 


We  have  seen  that  the  difference  I(s,  s’)  -  1(0,  s’)  between 

the  integrals  of  L  =  L(z)  evaluated  along  the  curves  C(s,s') 

and  C(0,  s')  =  C  is  the  sum  of  the  two  integrals 
t  -*“S ' 

;  l 

I  =  dt  :  Aj  =  L(X,xt)  -  L(X,xt) 


Vs’ 


A2  dt  ;  A2  =  L(X,xt)  -  L(X,xt) 


Vs 


and  we  now  proceed  to  appraise,  one  after  the  other,  these  two 
integrals.  We  have  endowed  our  (2n  *  1)  -  dimensional  state- 
space  with  a  Euclidean  metric. and  the  distance  between  the  two 


X  X  ' 

points  -  and  - 
I  xt  xt 


cf  this  space  is  s  t  -  t,  -  s’  , 


which  is  4  i  5’  .  If,  then,  5  is  sufficiently  small,the  point 

is  a  point  of  the 


x  i  _X 

-  of  our  state-space  is,  since  z  =  - 

\l  '  t 


image  T  ,  of  C  ,  ,  covered  by  the  point-set  Drf 
s, s  s, s 


of 


this  state -space  as  is,  also,  every  point  of  the  line  segment  which 

X 


x 

connects  the  two  points  J  and  r 

1  \ 


\  Xtl 


of  the  state-space. 
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The  Theorem  of  the  Mean  of  differential  calculus  assures  us 

_  1 

that  =  L^fX* ,  xt)  (X  -  X) ,  where  z’  =  -  is  some  point 


of  the  line  segment  which  connects  the  two  points 


_  and,  since  L_  is,  by  hypothesis,  continuous  over  D 

*t  1  * 

it  is  bounded  over  the  closed  subset  of  D;  in  other 

words,  there  exists  a  positive  constant  M  which  is  such 
that  the  magnitude  of  the  l  x  N  matrix  Lxlx'  ,  xt)  IS  <.  M 
for  all  values  of  t  in  the  interval  tj  •  s' v  t<  tj  ♦  s'.  Since 

the  magnitude  of  the  N  x  1  matrix  X  -  X  is  ^  5  5 ' ;  it 

follows  that  the  absolute  value  of  ,  the  product  of  the  second 

of  these  two  matrires  by  the  first,  is  v  M  !  !'  and  so 

v  2  M  3  V  s’ 

Turning  now  to  the  integral  , we  observe  that,  by  virtue  of 
the  Extended  Theorem  of  the  Mean  of  differential  calculus, 

A2  -  p(X,  xt)  (5t*xt)  ♦  |(xt-xt)  Lx,  x  (X,  v)  (YV 


where  X  is  some  point  of  the  line  segment  which  connects 

\  v  I 

the  two  points,  X  |  and  X  of  our  state-space.  The 
xt  I  t 

distance  between  these  two  points  is  and  so,  by  the  same 

argument  as  before,  the  point  v  is  covered  by  if  5  is 
sufficiently  small.  Over  the  interval  tj  -  s  i  t  ^  tj  , 
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xt  -  x{  is  the  consunt  n  x  1  matrix  5  u  while,  over  the 
interval  tj  t  t^  *■  s'  ,  it  is  the  constant  n  x  l  matrix  -s  u 
Hence,  the  integral  of  pL  (xt  -  xf) ,  where  denotes  the  value 


of  p  (X,  x  )  at  t  =  tj  ,  over  the  interval  tj  -  s'  <  t  <  tj  *  s' 


is  zero,  so  that 


Vs 


V9  ' 


(  p  (X,  xt)  (xt-xt) '  dt  ^  p(X,  xt)  -  Pj  j  (*t-xtJ  [  dt 


Vs  * 


By  virtue  of  the  continuity  of  the  1  x  n  matrix  p  along  C  ,  we 
know  that  the  magnitude  of  the  1  x  n  matrix  p(X,  x^)  -  Pj  is 

arbitrarily  small,  say  v  f,  over  the  interval  tj  -  s’  4  t^  tj  ♦  s 
if  ;  ’  is  sufficiently  small.  The  magnitude  of  the  nxl  matrix 
X  -  x{  is  5  over  this  interval  and  so  the  absolute  value  of 
the  matrix  product  p  (X,  xt)  -  p^  (x(  -  x^)  is  ^  t  5  over  the 


interval 


’  ntesjral 


tj  -  s'-  t-  t; 


/ 


Vs’ 


t  -s'  ^ 
1  s 


p  (X, 


♦  s’  .  Hence  the  absolute  value  of  the 

V  <W|  *  ls  ^  2  ^  5  s'  * 


where  -f  is  arbitrarily  small  if  the  positive  number  S'  is 
suffic’  ?ntly  small. 

Finally,  the  distance  between  the  two  points  z  =(  X  ]  and 

\  w 

X^l^  of  our  sute-space  :s  arbitrarily  3mall  over  the 
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interval  if  V  is  sufficiently  small,  by 

virtue  of  the  continuity  of  z  over  the  interval  Ht^'o. 

denote  by  J,  the  n-dimensional  matrix  L  ,  evaluated 
1  t  t 

at  t  -  t1  ,  and  we  suppose  that  there  exists  some  n  x  l  matrix 

u  of  unit  magnitude  which  is  such  that  u'  u  is  negative. 

Denoting  the  value  of  ^  u*  Jj  u  by  -k  so  that  k  •  o  we 
can  choose  5  and  V  sufficiently  small  so  that  the  absolute 
value  of  the  difference  between  the  tw  d  quadratic  forms 

1  u*  L  .  (X,  v)  u  and  J  u*  J  u  is  '  j  k  :  indeed, 

2  x*.  x.  z 


t  t 


X(tt) 


and 


X 

v 


of  our 


the  distance  between  the  two  points  xt(tj) 
state-space  is  arbitrarily  small  if  $  and  V  are  sufficiently 
small,  and  the  n-dimensional  matrix  ^  is,  by  hypothesis 

continuous  over  D.  Hence,  \  u-  L  x  (X,  v)  u  i.  negative  and 

2  t  t 

<  -  -  k  over  the  interval  t.  -  s’  ^  t  vj  tj  ♦  31  so  that 

3  1 


t.+s' 


1 

2  J 


1 


Vs' 


t  t 


u*  L  *  (X,  v)u|dt  is  negative  and  <  -  j  k  s’ 


Since  i  -  x  =  ♦  s  u  over  the  interval  tj  -  s’  *  t  <  ^  ♦  s’ . 
1  1  tj+  s' 


it  follows  that 

2 


Vs’ 


,|(V  vj* 


is  negative  and  <  -  \  k  s2  s'  Choosing  V  so  small, 
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5  and  s  remaining  fixed,  so  that  20<3ks  •  lt  {ollows 
ttot  I2  ..  negutiv*  and  <  -  j  k  s2s'.  No.  »e  ban  seen 

that  the  numerical  magnitude  crt  I ,  is  <  2  M  8  5  s  and’ 

choosing  5'  so  small,  S  and  s  remaining  fixed,  so  that 

2  M  a  S'  <  -  k  «2  ,  »e  see  that  I,  .  I2  is  negative,  being 
6 

<  .  i  k  s2  s’  Hence,  C  is  not  a  minimal  curve  of  L  and  we 
have  completed  the  proof  of  the  Legendre  condition: 

An  extremal  curve  C  of  L  ts  not  a  minimal  curve  of  L 
lf  the  quadratic  form,  \  u*  u  ,  evaluated  at  any  point  of 

C  ,  is  negative  for  any  n  x  1  matrix  u  of  unit  magnitude. 
Noter  In  our  argument  we  have  assumed  that  the  point  tj 


of  the  interval  a  <  t  <  b 
which  the  quadratic  form 


which  furnished  the  point  of  C  at 
I  u  •  L  u  was  evaluated, 

a  *Vt 


was  an  interior  point  of  this  interval.  However,  if  this  quadratic 
form  is  negative  at  one  of  the  end-points  of  this  interval,  it  is 
negative,  by  virtue  of  the  continuity,  along  C  of  the  n-dimensional 
matrix  L  ,  at  interior  points  of  the  interval  which  are 

xVt 

sufficiently  near  this  end  point. 

The  n  -  dimensional  matrix  x  ts  symmetric  and  so 

it  may  be  transformed  at  any  point  of  our  extremal  curve  C  by 

means  of  a  rotation  matrix  into  diagonal  form,  the  transforming 
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rotation  matrix  R  depending,  in  general,  upon  the  particular 
point  of  C  which  we  have  selected.  The  diagonal  elements  of  the 
diagonal  form  R*  x  R  of  x  >  which  are  all  real. 


t  t 


t  t 


are  the  characteristic  numbers,  Xy  -  ,  An,  of  x  » 

and  our  quadratic  form  ^  u*  L  m  u  appears  as 

L  *  t 


5  (Aj  (V1)  2  ♦  ... 


where  v\  ...  ,  vn  are  the  elements  of  a  n  x  1  matrix 
v  =  u  of  unit  magnitude.  Since  the  n  x  1  matrix  u  of 
unit  magnitude  is  arbitrary,  so  also  is  the  n  x  1  matrix  v  , 
u  being  R  v  ,  where  v  is  an  arbitrary  n  x  l  matrix  of  unit 
magnitude.  It  follows  that,  in  order  that  our  extremal  curve  C 
of  L  may  qualify  as  a  possible  minimal  curve  of  L ,  none  of  the 
characteristic  numbers,  ^  ,  -  •  •  ,  of  L  can  be  negative 

at  any  point  of  C  .  Indeed,  if  one  of  them.  I  say,  j  =  1  ,  .  .  -  ,  n  , 
is  negative  at  some  point  of  C  ,  we  need  only  set  all  the  elements  of 
v  ,  save  v^ ,  zero,  so  that  =  —  1,  to  obtain 


t  , 


A  ,  which  is  negative. 


1  u*  L  .  u  =  -  v*  R*  L,  R  v  =  «  , 

2  **txt  2  xtxt  2  ] 

Converse!^  if  all  the  characteristic  numbers,  » 


of  Lx*  x 
x  t  xt 


are  nonnegative  at  every  point  of  C, 


40 


-  u*  L  u  =  1  (A  fv1)2  *  -  *  A  (Vn)2  >  is  nonnegative  at  every 

2  X*t  Xt  2  L  1  n  i 

point  of  C  and  C  qualifies  as  a  possible  minimal  curve  of  L 
The  determinant  of  the  n  -  dimensional  matrix  L  m 

t  t 

is  the  product.  A,  ...  A  ,  of  its  characteristic  numbers  and  so, 

1  n 

if  L  *  is  nonsingular  along  C  (which  assumption  guarantees 
x  t  xt 

the  smoothness  of  C),  none  of  the  characteristic  numbers  of 

L  „  is  zero  at  any  oomt  of  C  In  order,  then,  that  C  may 
x  t  \ 

qualify  as  a  possible  minimal  curve  of  L  .  all  the  characteristic 

numbers  of  L  .  must  be  positive  (and  not  merely  nonnegative^ 
x*  x 
t  'c 


at  every  point  of  C  The  statement  that  all  the  characteristic 


numbers  of  L  ,  are  positive  is  equivalent  to  (that  is,  implies 

X  t  Xt 

and  is  implied  bvi  the  statement  that  the  quadratic  form 

1  u*  L  m  u  is  positive  no  matter  what  is  the  n  x  1  matrix  u 

2  X  t  Xt 


of  unit  magnitude  when  this  is  the  case  we  say  that  the 

n  -  dimensional  matrix  L  m  is  positively  definite  \fchen 

X  t  Xt 

L  m  is  positively  definite  along  C,  we  say  that  L  satisfies, 
x*t  Xt 

along  C  ,  the  strong  Legendre  condition,  as  opposed  to  this,  we  say 
that  L  satisfies,  along  C  ,  the  weak  Legendre  condition  when  we  are 
merely  assured  that  all  the  characteristic  numbers  of  L  . 
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are  nonnegative  at  every  point  of  C  .  In  other  words,  to  pass  from 
the  weak  to  the  strong  Legendre  condition, we  have  to  add  the 
requirement  that  the  n  -  dimensional  matrix  x^  be  non 


singular  at  every  point  oi  C  . 

The  characteristic  numbers  of  L>x#  x^  are  the  zeros-of  the 

polynomial  function  of  A,  of  degree  n,  det  *  Lx.^j  • 

where  E  is  the  n  -  dimensional  identity  matrix.  Fortunately, 
n 

however,  we  do  not  have  to  determine  these  zeros  in  order  to  decide 

whether  or  not  L.  is  positively  definite.  Indeed, 
x  txt 


where  I.  ,  j  =  I  »  •  •  •  > 
minors,  of  dimension 


A"-2 


♦  (-D  i 


n  ,  is  the  sum  of  the  various  principal 
j  ,  of  the  n  -  dimensional  matrix  Lx< 


x 


t 


For  example,  Ij  is  the  sum  of  the  diagonal  elements,  i.  e.  the 

trace,  of  L  ,  x  and  I  is  the  determinant  of  Lx  •  In 
t  t  1 

general,  I  is  the  sum  of  the  products  (j  at  a  time)  of  the 

,  \  1  n[  i  Therefore, 

n  characteristic  numbers  Aj,  ...  ,  Afi  oi  ux*t  x(  ‘ 

if  L  *  is  positively  definite  so  that  all  its  characteristic  numbers 

x*t  *t 


are  positive,  all  the  n  numbers,  Ij,  •••  >  !n<  are  positive 
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Conversely,  if  all  these  n  numbers  are  positive,  each  term  of  the 
polynomial  function,  An  -  Ij  A  ♦  •  •  •  ♦  ("1)  >  ^ias>  w^en 

A  is  negative,  the  same  sign.so  that  this  polynomial  function  has  no 
negative  zeros. 

Thus 

L  satisfies  along  C  the  strong  Legendre  condition  if,  and 
only  if,  all  the  n  numbers,  Ij  ,•••,!„ »  are  positive  at  each 
point  of  C. 

Example.  For  a  mechanical  system  with  n  degrees  of  freedom, 

L  is  of  the  form  T  (X,  x()  -  V  (X) ,  where  T  is  a  quadratic 
polynomial  function,  not  necessarily  homogeneous,  of  the  n  x  1 
velocity  matrix  x{ .  Writing  the  terms  of  T  which  are  of  the 
second  degree  in  the  elements  of  x^  in  the  form  j  ^  x^ , 
where  G  =  G  (X)  is  a  n  -  dimensional  matrix  point  function  in 
our  time-coordinate  space,  our  n  -  dimensional  matrix 

is  simply  G  ;  the  transpose  p*  of  the  1  x  n  momentum  matrix 
p  is  Gx(.  Thus  L  satisfies  the  weak  Legendre  condition 
along  C  if,  and  only  if,  all  the  characteristic  numbers  of  G  are 
nonnegative  at  each  point  of  C  ,  in  which  case  we  say  that  G  is 
positively  semi-definite  along  C  .  On  the  other  hand,  L  satisfies 
the  strong  Legendre  condition  along  C  if,  and  only  if,  G  is 
positively  definite  along  C.  If  n  =  1 ,  so  that  our  mechanical  system 
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has  but  one  degree  of  freedom,  G  is  a  1  -  dimensional  matrix, 
i.  e.  ,  a  point  function,  in  the  (t,  x)  -  plane,  and,  in  order  that  an 
extremal  curve  C  of  L  may  qualify  as  a  possible  minimal 
curve  of  L,  this  point  function  must  be  nonneuame  at  "very 
point  of  C 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  6 

Constrained  Problems,  The  Hamilton  Canonical  Equations 

We  now  suppose  that  the  curves  in  our  N  -  dimensional 

b 

time-coordinate  space  along  which  the  integral  I  =  L  dt 

-'a 

of  a  problem  of  Type  1  of  the  calculus  of  variations  is  evaluated; 
are  all  constrained  to  lie  on  an  n-dimensional  point  locus.  We 
term  a  problem,  of  Type  1  .  of  the  calculus  of  variations, where 
such  a  supposition  is  made,  a  constrained  problem.  We  shall 
suppose  that  the  equation  of  the  n  -  dimensional  point  locus  in  our 
N  -  dimensional  time-coordinate  space  is  of  the  form  f  (X)  =  0, 
where  the  function  f  is  continuously  differentiable  over  some 
convenient  connected  region  R  of  the  time-coordinate  space 
and  we  term  the  equation  f  (X)  =  0  the  constraint  imposed  on 
our  problem.  A  simple  example  of  such  a  constrained  problem  is 
the  following: 

2  12  2  2 

Gi,-en  two  points  on  the  sphere  t  ♦  (x  )  ♦  (x  )  =  1  in 
3  -  dimensional  Euclidean  space, does  there  exist  on  this  sphere 
a  rectifiable  curve  joining  these  two  points  whose  length  is  least, 
and,  if  so,  is  this  curve  unambiguously  determinate? 
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The  argument  for  a  constrained  problem  proceeds  as  in  the  case  of 
an  ordinary  ,  or  unconstrained,  problem  the  only  difference  being 
that  we  must  always  respect,  in  the  constrained  problem,  the 
constraint  f  (X)  -  0.  Confining  our  attention  to  smooth  extremal 

curves  of  L  and  writing  I  in  the  parametric  form, 

fi 

r 

I  F  d  7  we  have 


5  I 


J 


Fx5X.P6Xrjd 


where  the  parametric  momentum  matrix  P  =  Fx  is,  by 

virtue  of  our  hypotheses  concerning  the  Lagrangian  function 
L  =  L  (X,  xt),  continuously  differentiable  along  the  curve  C 
of  integration  so  that  the  term  P  8  Xy.  =  P  (6  X)^  may  be 
integrated  by  parts  to  yield,  since  8  X  is  zero,  by  hypothesis,  at 


fl 


r=  a  and  at  T*  fi,  J  (PSX^df  = 

/V 


.  f 

J 


e 


(Pj.  6X)  d  T .  Thus 


8  I 


-L 


s 


p-*Fx-'5X  dT 


We  cannot,  however,  conclude  that  the  necessary  and  sufficient 
condition  that  8  I  be  zero  for  arbitrary  allowable  variations  6  X 
of  the  curve  C  of  integration  is  that  P^  -  F^;  this  relation  is 
obviously  sufficient  but  it  is  too  strong,  i.  e. ,  it  is  not  necessary. 
Indeed,  these  variations  8  X  are  subject  to  the  relation  6X=  0, 
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since  the  family  1  c-mpa 


r i ■  j r-  cur  *-3  in  which  we  suppose  C  to 


be  imbedded  ail  lu.  by  hyp-thesis,  in  the  n  -  dimensional  point 

locus  j  iX)  h  If,  then,  A  is  an  undetermined  multiplier, 

!  e  ,  any  function  of  T  which  is  continuous  over  the  interval 
a  T ^  6  we  may  write  "  I  in  the  form 

‘■i  ■  j  |lpr  '  Fx‘;-,xl'xid' 

a  1 


and  the  relation  P.  --  Fx  >1  >x  n  sufficien',  no  matter  what 
the  undetermined  multiplier  A  for  the  vanishing  of  5  1  ,  for 
all  allowable  variations  6  X  of  the  curve  of  integration  We 
proceed  now  to  show  that  'his  condition,  with  an  appropriate 
determination  of  A,  is  also  necessary  for  the  vanishing  of  5  I 
for  all  allowable  variations  5  X.  We  first  observe  that  any 
point  of  our  n  -  dimensional  point  locus  f  (X)  *-  0  ,  at  which  fx 
1S  the  zero  1  x  N  matrix,  is  a  singular  point  of  this  point  locus 
and  we  assume  that,  if  any  such  singular  points  exist,  their 
number  is  finite.  If,  then,  we  can  prove  that  Pf  ♦  Afx  , 

with  an  appropriate  determination  of  A  ,  at  any  point  of  C  which 
is  not  a  singular  point  of  the  point  locus  f  <'X,  0  ,  it  follows 

from  the  continuity  along  C  of  both  sides  of  this  equation  that 
the  equation  continue  to  hold  at  those  points  of  C  ,  if  any  such 
exist,  which  are  singular  points  of  this  point  locus.  Let,  then,  \ 
be  a  point  of  the  interval  or  <  8  at  which  one  of  the  elements, 
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not  zero  and  let  us  set 


f  .  say,  of  the  1  x  N  matrix  f  „ 

A  ' 

all  the  elements  of  the  N  x  1  matrix  £X  zero  along  C  except 
‘X^  and  1  xk  where  k  is  any  one  of  the  number;,,  different  from 
j  of  the  set  1,  N  We  mav  assign  arbitrary  values  to 

5Xk  and  then  'X1  is  determined  at  any  point  where  fxJ  f  o  , 
by  means  of  the  equation  'XJ  ♦  o  We  now 

determine  the,  as  yet  undetermined  multiplier  \  so  that 
over  an  interval  T  -  5  ^  \  t  which  is  so  small  that 

/x,  is  different  from  zero  over  it  (this  being  always  possible  since 

f  is  different  from  zero  at  T  and  continuous  over  a  ^  r<  fl) 

X 

,p.)_  =  Fyj  ♦  A  /x j  :  then  5  I  reduces  to 

51  -  J,,  -  Fxk-Atx*V: dT 

on  the  understanding  that  £X^  and  'X^1  are  set  equal  to  zero  at 
those  points  of  the  interval  a  <  B  which  are  not  covered  by 

the  interval  -  |5"  ^  ^  ^  ^X*1  ^X*1 

were  not  zero  at  T  =  T  we  could  choose,  by  virtue  of  the  continuity 

of  this  expression  over  the  interval  a  ^  ^  6  ,  so  small  that 


ls  one-signed  over  the  interval 
T  -  a'  <  T<  Tj  ♦  5'.  All  we  have  to  do  to  ensure  that  '  I  is 
not  zero  is  to  choose  6X*  so  that  it  also  is  one-signed  over  the 
interval  T  -  fi*  <  T<  Tj  ♦  6’ .  Since  k  is  any  one  of  the  numbers 
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different  from  j 


of  the  set  l , 


N  it  follows  that,  at 


interval 


f  ,and,  since  Tj  1S  an>'  Point 
at  which  fx  is  not  the  zero  1  x  N  matrix, 


this  implies  that  the  equation 

pr  Fx‘lfx 

lS  valid  at  all  the  points  of  C  This  equation  is  the  Euler- 
Ugrange  equation  for  constrained  problems  of  Type  1  of  the 
calculus  of  variations  Strictly  speaking,  A  is  not  determined  at 
any  point  of  C  which  is  a  singular  point  of  the  n  -  dimensional 
point  locus  f  (X)  ^  0  ,  but  we  define  it  at  such  points,  if  any 
such  exist  on  C  ,  by  the  requirement  that  A  be  continuous  along  C 
Exampie.  The  constrained  arc -length  problem  In  N  -  dimensional 

Euclidean  space  ^ 

Here  F  =  (X^  *  2  .  P  *  T  ■  «*  Euler-Ugrange 


equation  is  -U  -  ^  =  A.fx  ■  Choosing  as  our  parameter 

F  F 


the  arc -length  s  along  C  ,  F  has  along  C  the  constant 
value  l  ,  and  the  Euler- Lagrange  equation  reduces  to  X*g  =Afx 
or,  equivalently,  Xsg=Afx.  •  When  f  (X)  is  \  (X*X  -  r2) , 
so  that  the  point -locus  whose  equation  is  f  (X)  =  0  is  a  sphere  of 
radius  r  with  center  at  the  origin,  our  Euler- Lagrange  equation 
IS  X  =  Ax  Thus  along  any  extremal  curve  of  this  constrained 
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arc-length  problem.  X*Xss  -  4.X*X  -  4  r2  Since  X'X^  -  0 

2 

along  any  curve  on  the  sphere  whose  equation  is  X*X  -  r  and 

since  X*  X  =  l  ,  X*X  *1=0  along  our  extremal  curve 
s  s  ’  ss 

so  that  our  undetermined  multiplier  \  has  the  constant  value 

-  m  this  problem  The  curvatur- vector  X..  has  the 

r 

direction  opposite  to  that  of  the  radius  vector  X  .  and  the 
magnitude  of  this  curvature  vector  being  the  product  of  the 
magnitude  r  of  the  radius  vector  by  the  absolute  value  of  A  ,  is 

-  .  From  the  relation  X  -  AX  we  obtain  X^  -  X1^  _  0’ 

f  ss  ss  ss  -  u 

where  ]  and  k  are  any  two  different  numbers  from  the  set 
1,  ...  ,  N,  and  so  XJXk.  -  xSf^  has  a  constant  value,  a]k  say, 

along  any  extremal  curve  This  implies  that  if  ],  k,  1  is  any 

triad  of  different  numbers  from  the  set  1,  .  .  ,  N  ,  then 

X*  ♦  akl  XJ  ♦  a1'  -  0  In  particular,  we  see  that  any  extremal 

curve  of  the  constrained  arc -length  problem  on  the  sphere 

a1)2 .  ex2.2  ♦  (X3)2  =  r2  ,  m  3-dimensional  Euclidean  space,  is 

an  arc  of  a  great  circle  of  this  sphere  obtained  by  intersecting  it 

with  a  plane,  a23  X1  ♦  a31  X2  ♦  a12  X3  =  0,  which  passes  through 

the  origin,  i.  e. ,  the  center  of  the  sphere. 

The  2n  x  1  matrix  X  is  commonly  termed  the  phase 

PI 

matrix  of  a  dynamical  system, and  we  shall  term  the  2N  x  1 
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matrm  £  the  extended  phase  matrix  of  the  system.  Regarding 
the  elements  of  the  2N  x  1  matrix  *  j  as  the  coordinates  of  a 
point  in  a  2N  -  dimensional  space,  which  we  endow  with  a  Euclidean 
metric,  we  refer  to  this  space  as  the  extended  phase-space  of  the 
mechanical  system  or,  generally,  of  any  problem  of  Type  l,  either 
unconstrained  or  constrained,  if  the  calculus  of  variations.  Now 
P  =  L  (X  x  )  -  p  x(  and  the  right-hand  side  of  this  equation  is, 
by  virtue  of  our  hypotheses  concerning  the  Lagrangian  function 
L  ,  a  continuously  differentiable  function  of  X  and  p  In  the  case 
of  a  conservative  mechanical  system,  for  which  T  is  a  homogeneous 
second-degree  function  \  x\  G  xt  of  the  velocity  n  x  1  matrix 
X[  whose  matrix  G  is  nonsingular  .  we  have  p  -  G  x{ 

so  that  x  -  G*1  p*  Hence  T  may  be  expressed  as  ^  p  G  p* 

and  P  -  -  -  p  G~l  p*  -  V  .  The  right-hand  side  of  this  equation 
1  2 

is  the  negative  of  T  ♦  V  ,  when  expressed  as  a  function  of  X 
and  p  :  we  denote  T  ♦  V  ,  when  expressed  as  a  function  of  X  and 
p,  by  H  (X,  p)  and  we  term  H  (X,p)  the  Hamiltonian  function  of 

the  conservative  mechanical  system.  Similarly,  for  any  problem 
of  Type  1  of  the  calculus  of  variations,  unconstrained  or  constrained, 
we  denote  pyL  (X,  x{) ,  when  expressed  in  terms  of  X  and  p , 
by  H  (X,  p)  and  we  term  H  (X,  p>  the  Hamiltonian  function  of  the 
problem.  Then,  ♦  H  (X,  p)  =  0  so  that  the  points  Z  =  (j) 
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of  our  extended  phase-space  are  constrained  to  he  on  the 
(2N  -  1)  -  dimensional  point  locus  £  (Z)  ■=  0  ,  where 

<£(Z)  *  Pj  ♦  H  (X,  p). 

Since  our  parametric  integrand  F  -  F  (X,  Xr>  is  a 
positively  homogeneous  function,  of  degree  1,  of  the  N  x  1 

matrix  X_  ,  we  have  F  =  FY  =  P  X^.and  so  our 

7  ^  r 

integral  I  =  I  Fd  ^  may  be  written  in  the  form 

I  * j  PdX 

We  may  regard  the  right-hand  side  of  this  equation  as  an 
integral  extended  along  a  curve  in  our  2N  -  dimensional 

extended  phase-space,  being  constrained  to  he  in  the 
(2N  -  1)  -  dimensional  point  locus  (Z )  -  0  in  this  space. 

Trie  integrand  F  =  P  Kj.  of  this  integral  is  a  positively 
homogeneous  function,  of  degree  1  ,  of  the  2N  x  l  matrix 

xrl 

Zr  -  P  ,  and  the  corresponding  parametric  momentum  matrix 

\  Tl 

is  the  l  x  2N  matrix  (P,  C  )  Since  =  (0,  X  „  ) ,  the 
Euler- Lagrange  equation  of  the  corresponding  constrained  problem 
of  Type  1  of  the  calculus  of  variations  is 


*  =  *HX:0  =Xr  ♦ 


J  X  ''“X  •  “  "r  ’  ^ 

The  first  of  the  last  N  of  this  set  of  2N  equations  is 
0  =  tj,*  A  and  the  remaining  n  oi  this  set  of  N  equations  are 
equivalent  to  the  matrix  equation  x T  ♦  )  Hp  -  0  .  Thus  the 
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BLANK  PAGE 


Euler- Lagrange  equation  of  our  constrained  calculus  of  variations 


problem  may  be  written,  since  Pj  =  -  H ,  as  follows 


=dt  =^k_  =  dH  =_^dr  ,  3f  k  =  1,  ...  ,  n 


H. 


T--Hxk  Ht 


These  equations  are  known  as  the  Hamilton  canonical  equations 
of  our  unconstrained  problem  of  Type  1  of  the  calculus  of 
variations. 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  7 

The  Reciprocity  between  L  and  H  ;  The  Transversality  Conditions 

The  Hamiltonian  function  H  (X,  p)  of  an  unconstrained 
problem  of  Type  1  of  the  calculus  of  variations  is  connected 
with  the  Lagrangian  function  L  (X,  x{)  of  this  problem  by  means 
of  the  relation 

L  (X,  xt)  ♦  H  (X,  p)  =  p  xt 

and  the  symmetry  of  this  equation  indicates  that  the  relationship 

between  the  two  functions  L  and  H  is  a  reciprocal  one.  On 

differentiating  the  relation  just  written  with  respect  to  xt,  X 

being  held  constant,  we  obtain  p  ♦  H  ,  p*  =  p  ♦  x*  p*  ,  or, 

P  1 

equivalently,  H  ,  p*  =  x*  p*  (Note:  In  differentiating  the 
P  xt  1 

matrix  product  p  xt  with  respect  to  x^ ,  we  write  this  matrix 

product  in  the  equivalent  form  x*(  p*  when  differentiating  the 

matrix  factor  p,  in  order  to  encounter  the  n  -  dimensional 

matrix  p*  .  p  is  not  an  n  -  dimensional  matrix  but,  rather, 
xt  xt 

2 

a  1  x  n  matrix, and  the  matrix  product  p  x  does  not  exist. ) 

xt  1 
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p*  is  the  n  -  dimensional  symmetric  matrix  L  ,  which 
xt  *  t  *t 

we  have  assumed  to  be  nonsingular  over  the  region  O  of  our 

(2n  +  1)  -  dimensional  state-space,  and  so  the  equation 

H  *  p*  =  x*.  p*  implies  the  equation  H  „  =  x*  or, 
p*  t  x^  P  1 

equivalently,  the  equation 

VHP 

This  equation  plays,  when  we  start  with  the  Hamiltonian  function 

H  =  H  (X,  p)  the  role  played  by  the  equation  p  =  L  wjien  we 

t 

start  with  the  Lagrangian  function  L  =  L  (X,  xt) .  The 
n-dimensional  matrix  H  ,  is  the  Jacobian  matrix  of  xt 

r  r 

with  respect  to  p* ,  so  that  it  is  the  reciprocal  of  the  n  - 

dimensional  matrix  L,  ,  which  is  the  Jacobian  matrix 

t  xt 

of  p*  with  respect  to  xt .  The  N  x  1  matrix  X  is  held 
constant  when  calculating  these  Jacobian  matrices.  If,  then,  we 
start  with  the  Hamiltonian  function  H  =  H  (X,  p)  and  wish  to 
determine  the  corresponding  Lagrangian  function  L  =  L  (X,  xt) , 
we  assume  that  the  n  -  dimensional  matrix  Hp  p,  exists  and 
is  continuous  and  nonsingular  over  a  region  of  the  (2n  ♦  1)  - 
dimensional  |* |  -  space,  so  that  the  formula  xt  =  Hp  (X,  p) 
defines  p,  over  the  corresponding  region  of  the  (2n +1) - 
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dimensional  state-space,  as  a  continuous  function  of  the 
(2n  ♦  1)  x  1  matrix  z~  |*  J .  Then  L  (X,  x{)  is  furnished, 

over  this  region  of  the  state-space,  by  the  formula 
L  (X,  xt)  =  p  xt  -  H  (X,  p) . 

We  see,  then,  that  the  n  equations  furnished  by  the  matrix 

equation  xt  =  Hp  have  nothing  to  do  with  the  particular  smooth 

curve  x  =  x  (t) ,  a  <  t  <  b  in  our  time-coordinate  space.  They 

merely  restate  in  a  different  way  the  definition  of  the  Lagrangian 

momentum  matrix  p ,  which  was  originally  defined  by  means  of 

the  formula  p  =  L  Any  such  smooth  curve  has  an  image  T 
Xt 

in  the  2N  -  dimensional  extended  phase-space,  since  X  and  P 
are  defined  as  functions  of  T  over  the  parametric  interval 
a  <  6,  and  T  lies  in  the  (2N  -  1)  -  dimensional  point  locus 

whose  equation  is  0  (Z)  =  0 ,  where  Z  is  the  2N  x  1  matrix 

jpj  and0  (Z)  ■  Pj  ♦  H  (X,  p) .  Along  T,  we  have 
x  =  H  =  <D  and  this  equation  may  be  extended,  since  <!>„  =  1 , 

t  p  *p  *  ** j  ' 

to  read  X^  =  0p  .  In  other  words,  one -half  of  the  2N  first- 
order  differential  equations  of  any  extremal  curve  of  our  constrained 
problem  in  the  2N  -  dimensional  extended  phase-space  are  satisfied 
by  the  image,  in  this  space,  of  any  smooth  curve  in  our  N  - 
dimensional  time-coordinate  space.  We  proceed  to  show  that  the 
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other  half  of  our  2N  equations  are  satisfied  if,  and  only  if,  the 
curve  in  our  N  -  dimensional  time-coordinate  space  is  an 
extremal  curve  of  the  Lagrangian  function  L  of  our  unconstrained 
problem;  in  other  words,  they  are  equivalent  to  the  Euler- Lagrange 
equation  =  Fx*  Indeed.  8ince  F=Lt^.  this  Euler- 
Lagrange  equation  may  be  written  in  the  equivalent  form 
Pt  =  1^  and  on  differentiating  with  respect  to  X,the  relation 
L  (X,  xt)  ♦  H  (X,  p)  =  p  xt  =  x*t  p*  ,  xt  being  held  fixed,  we 

obtain  +  Hx  ♦  Hp,p*x  =  x*t  p*x-  This  implies,  since 

H  =  x*  that  L~.  =  -  Hv  .  Thus  our  Euler- Lagrange  equation 
is  equivalent  to  the  equation  =  -  Hx ,  and  this  may  be  written, 

since  Pj  =  -  H  (X,  p) ,  as  =  d  t ,  j  *  1,  . . . ,  n  . 

Xj  1 

Thus  we  have  the  following  important  result: 

The  images,  in  the  'fcN  -  dimensional  extended  phase-space, 
of  the  extremal  curves,  in  the  N  -  dimensional  time-coordinate 
space,  of  the  Lagrangian  function  of  our  unconstrained  calculus 
of  variations  problem  are  the  extremal  curves  of  the  Lagrangian 
function  P  X^.  of  the  corresponding  constrained  calculus  of 
variations  problem  in  the  2N  -  dimensional  extended  phase-space. 

Up  to  this  point  we  have,  when  defining  an  extremal  curve 
of  a  given  Lagrangian  function  L  =  L(z) ,  required  that  the  end- 


57 


points  A  =  X  (a)  and  B  -  X  (fi)  of  the  various  members  of  our 
family  of  comparison  curves  should  all  be  the  same.  We  now 
investigate  what  form  6  I  takes  when  these  end-points  vary, 
and,  to  indicate  this  variation,  we  write 

A  =  A  (s)  =  X  (a,  s)  ;  B  -  B  (s)  =  X  (B,  s) 

It  is  often  convenient  to  permit  the  parametric  interval 
a4.T^B  to  vary  with  s  so  that  a  and  B  are  functions, 
which  we  assume  to  be  continuously  differentiable,  of  the  parameter, 
or  parametric  matrix,  s  .  Then  6  1  appears  as 


6  I  »  F  (B)  6  B  -  F  (a)  6  a  + 


JB  |fx6X+  P  6  Xrj  dT 


where  F  (B)  =  F  (B,  X^(B)  )  and  F  (a)  -  F  (A,  X ^  (a)  )  .  If 
the  curve  of  integration  C  is  a  smooth  extremal  curve  of  L 
or,  equivalently,  of  F ,  the  parametric  momentum  matrix  P 
is  continuously  differentiable  along  C,  and  the  integrand 
P  6  X  -  P(6X)  may  be  integrated  by  parts  to  yield 


6 


Ja 
so  that 


Jp  6  X^JdT  =  P  (B)  6  X  (B)  -  P  (a)  6  X  (a)  -J  jp^  6  xJdT 


6  I  =  F  (fl)  6  fi  ♦  P  <B)  6  X  (6)  -  |f  (a)  6  a  ♦  P  (a)  6  X  (a)J 

■a 


(Pf-  FX)6X>  d  r 
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Since  the  curve  of  integration  C  is,  by  hypothesis,  a  smooth 
extremal  curve  of  L,  Pf  -  Fx  is  the  zero  lxN  matrix 
along  it, so  that  our  formula  for  6  I  reduces  to 

6  1=  |f  (B)  6  B  ♦  P  (B)  6  X  (B)j  -  |f  (a)  6  a  ♦  P  (a)  6  X  (a) 

This  may  be  simplified  further,  since  F  =  F  (X,  X^.)  is  a 
positively  homogeneous  function,  of  degree  1,  of  ;  thus 
F  (B)  =  P  (B)  X^B)  and  F  (a)  *  P  (a)  X^.  (a)  and,  since 

5  B  *  X  (B)  6  B  +  6  X  (fi)  and  6  A  =  X  (a)  6  a  ♦  6  X  (a) ,  we 

T  1 

haVG  6  I  *  P  (B)  5  B  -  P  (a)  6  A 

Wa  rningi  You  must  avoid  the  error  of  identifying  6  X  (a)  with 

6  A  or  6  X  (B)  with  6B.  A,  for  example,  is  the  value  of  X 
when  T  -  a  so  that  it  involves  the  parameter,  or  parametric 
matrix,  s  not  only  explicitly  but  also  implicitly  through 

a  -  a  (s) ;  thus  6  A  involves,  in  addition  to  the  term 
Xs  (o)  d  s  =  6  X  (a) ,  the  term  X^(a)  6  a  . 

In  order  that  the  arc  of  our  smooth  extremal  curve  of  L 
between  the  points  A  and  B  may  qualify  when  these  end¬ 
points  of  the  arc  may  vary  as  a  possible  minimal  curve  of  L , 
we  require  that  6  I  be  zero  for  all  permissible  variations  of  the 
end-points  A  and  B.  Holding  A  fixed  so  that  6  A  =  0  , 
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we  obtain  the  necessary  condition  P  tB)  6  B  =  0  ;  similarly, 

P  (a)  6  A  =  0 .  The  two  conditions 
P  (a)  6  A  0  ;  P  <B)  6  B  =  0 

which  are  necessary  in  order  that  our  smooth  extremal  curve 
of  L  may  qualify  (when  its  end  points  may  vary)  as  a  possible 
minimal  curve  of  L  ,  are  known  as  the  Transversality 
Conditions. 

Example.  For  the  arc-length  problem  in  N  -  dimensional  space, 

F  =  Jx^X^j172  ,  so  that  P*  =  is  the  N  x  1  matrix,  of 

unit  magnitude,  which  furnishes  the  direction  of  the  curve  X  =  X  (T) , 
<*4  T4&,  In  ‘he  sense  in  which  T  is  increasing.  Since  Fx 

is  the  zero  lxN  matrix,  P  or,  equivalently,  P*  is 
constant  along  any  extremal  curve  of  F  and  the  transversality 
conditions  express  the  fact  that  for  any  extremal  curve  to  qualify 
(when  its  end  points  may  vary)  as  a  possible  minimal  curve  of  F, 
the  extremal  must  be  perpendicular  to  all  possible  directions  in 
which  either  of  the  two  end-points  may  move.  For  example,  in 
the  arc -length  problem  in  the  plane  (if  the  end-points  are  required 
to  lie  one  on  each  of  two  nonconcentric  circles)  any  extremal  curve, 
which  is  necessarily  a  line-segment,  must  pass  (if  it  is  to  qualify 
as  a  possible  minimal  curve)  when  produced,  if  necessary, 
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through  each  of  the  two  centers  of  the  twcf  given  circles.  This 
necessary  condition  is  not  sufficient;  there  are  four  line-segments 
on  the  line  joining  the  two  centers  which  have  one  end-point  on 
one  of  the  two  circles  and  the  other  end-point  on  the  other 
Of  these  four  line-segments  only  one  is  a  minimal  curve, the 
other  three  being  neither  minimal  nor  maximal  curves. 

We  now  consider  an  n  -  dimensional  locus  of  initial  points 
A  in  our  N  -  dimensional  time-coordinate  space  so  that  the 
N  x  1  matrix  A  is  a  function,  which  we  assume  to  be  con¬ 
tinuously  differentiable,  of  an  n  x  1  parametric  matrix  s. 

We  suppose  that  wo  are  given  over  this  locus  a  continuously 
differentiable  n  x  1  matrix  function  of  «  which  we  denote  by 
xt  (A) ,  since  we  intend  to  consider  the  extremal  curves  of  L 
which  pass  through  the  various  points  A  and  whose  velocity 
n  x  1  matrices  are  furnished  at  A  by  the  matrix  function  in 
question.  This  matrix  function  is  assumed  to  be  such  that  the 
points  z  =  j*(A))  of  our  (2N  -  1)  -  dimensional  state- space 

are  all  covered  by  the  region  D  of  this  space  over  which  we 
suppose  that  the  n  x  <2N  -  1)  matrix  Lx*  %  and  the  1  x  N 

matrix  Ljj  are  continuously  differentiable.  As  a  result 


through  ci  :h  point  A  of  our  n  -  dimensional  locus  an 
unambiguously  determinate  extremal  curve  of  L ,  whose 
velocity  matrix  at  A  is  furnished  by  the  given  n  x  1  matrix 
xt  (A)  .  We  suppose  further  that  the  n  -  parameter  family  of 
extremal  curves  of  L  which  we  obtain  in  this  way  simply  covers 
a  region  R  of  our  N  -  dimensional  time-coordinate  space, 
where  R  is  covered  by  the  projection  of  D  on  this  space.  In 
other  words,  we  suppose  that  there  passes  through  each  point 
X  of  R  one,  and  only  one,  member  of  our  n  -  parameter  family. 

The  equations  of  our  n  -  parameter  family  of  extremals  are 
of  the  form  X  =  X  (  T,  a),  where  the  Nxl  matrix  X  is  at 
each  point  of  R  a  continuously  differentiable  function  of  the 
Nxl  matrix  S  =  J^j  .  The  statement  that  our  n  -  parameter 
family  of  extremal  curves  of  L  simply  covers  R  is  equivalent  to 
the  statement  that  the  N  -  dimensional  Jacobian  matrix  Xg  is 
nonsingular  over  R.  Thus  S  is  a  continuously  differentiable 
function  of  X  over  R  and  so  xt,  being  a  continuously  differen¬ 
tiable  function  of  S ,  is  a  continuously  differentiable  function  of 
X  over  R .  We  denote  this  n  x  1  matrix  function  of  X  by 
w  (X)  .  Our  1  x  N  parametric  momentum  matrix  P  is  then 
a  continuously  differentiable  function  of  X  over  R .  When 
n  *  1,  so  that  our  problem  of  Type  1  of  the  calculus  of  variations 
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is  a  plane  problem,  we  term  such  a  1  -  parameter  family  of 
extremal  curves  an  extremal  field  over  R ,  and  we  term  the 
point  function  w  *  w  (t,  x),  the  field  function.  When  n  >  1 , 
however,  it  is  necessary  for  our  n  -  parameter  family  of 
extremal  curves  to  oossess  a  property,  which  is  not  required 
when  n  =  1 ,  before  we  term  it  an  extremal  field  over  R  • 

When  our  n  -  parameter  family  does  possess  this  property, 
we  refer  to  the  n  x  1  matrix  w  (X)  as  the  field  matrix. 

We  shall  discuss  in  our  next  lecture  the  property  which  our 
n  -  parameter  family  of  extremals  must  possess,  when  ni>  1, 
before  it  can  qualify  as  an  extremal  field  over  R,  and  we  shall 
begin  to  see  the  fundamental  significance  of  extremal  fields  in 
the  determination  of  sufficient  conditions  for  an  extremal  curve 
of  L  to  qualify  as  a  minimal  curve  cf  L . 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  8 


Extremal  Fields;  The  Hilbert  Invariant  Integral 


rb  rfi 

We  have  seen  that,  if  I  =  /  L  dt  =  /  F  d"T  is  the 


integral  of  a  problem  of  Type  1  of  the  calculus  of  variations 
evaluated  along  a  smooth  extremal  curve  C  of  L  between 
two  variable  end-points  A  and  B  ,  then 
«I  =  P(fi)5B-P(fl)5A 


We  now  suppose  that  C  is  a  member  of  an  n-parameter  family 
of  extremal  curves  of  L  of  the  type  described  in  our  last  lecture 
(so  that  each  member  of  the  f;  mily  intersects  a  given 
n  -  dimensional  point  locus  which  we  regard  as  the  locus  of  our 
initial  points  A  ,  and  the  family  covers  simply  a  given  region 
R  of  our  N  -  dimensional  time-coordinate  space)  A  is,  by 
hypothesis,  a  continuously  differentiable  function  of  an  n  x  1 
parameter  matrix  s,  and  if  X  is  any  point  of  that  member  of 
our  n  -  parameter  family  of  extremal  curves  of  L  whose  initial 
point  is  A,  X  =  X  (  T,  s)  is  a  continuously  differentiable  function 
ofthe  Nxl  matrix  S  *  ).  Setting  T  equal  to  an  arbitrarily 

chosen  continuously  differentiable  function  6  of  s  ,  we  may 
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take  B  =  X  (6,  s)  as  the  second  end-point  of  an  extremal  arc  of  L 
whose  first  end-point  is  A  and  avail  ourselves  of  the  relation 
6  I  =  P  (A)  6  B  -  P  (a)  5  A  .  Letting  B  vary  along  an  arbitrary 

piecewise-smooth  curve  C*  connecting  any  two  points  Bj 

and  B  of  R,  A  will  vary  along  an  unambiguously  determinate 

2 

piecewise-smooth  curve  C*  ,  in  our  given  n  dimensional  point 
locus,  connecting  two  points  Aj  and  Ag  ,  of  this  locus  which 
are  unambiguously  determined  by  the  two  arbitrarily 
given  points  Bj  and  Bg,  respectively,  of  R  since  the  N-dimensional 
Jacobian  matrix  Xg  is,  by  hypothesis,  nonsingular  over 
r.  Denoting  by  ^  and  I2  the  values  of  I  evaluated  along 
the  extremal  curves  Cj  and  C2  of  L  whose  end-points  are 
(Aj,  Bj)  and  (A2,  B2) ,  respectively,  we  obtain,  on  integrating 
the  relation  6  I  -  P  (B)  6  B  -  P  (a)  6  A  with  respect  to  the 
independent  variable  which  identifies  the  points  of  the  a  rbifrarily 

chosen  piecewise-smooth  curve  Cg,  the  relation 

h  '  h  "  I  *  p  (6)  6  B  /  .  p  (a)  6  A 

CB  CA 

This  equation  has  the  notable  feature  that  its  left-hand  side  is 

unambiguously  determined  by  the  two  arbitrarily  given  points 

B  and  B0  of  R  and  is,  accordingly,  insensitive  to  a  change 
1  2 
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I 


of  the  particular  piecewise* smooth  curve  Cg  that  connects  these 


points.  Thus  the  difference 


f  P'B)6B  ■  j  P  (a)  6  A 
C*  ” 


B  A 

depends  only  on  the  two  points  Bj  and  Bg  and  not  at  all  on 
the  particular  piecewise- smooth  curve  Cg  that  connects  these  two 
points.  When  n  =  l,  our  locus  of  initial  points  A  in  which  the 
curve  must  lie  is  1  -  dimensional,  i.e.  is  a  curve. 

Therefore  is  unambiguously  determined  by  the  two  points  Bj  and 


B  p  and  is,  like  the  difference 
£» 


L 


P  (B)  6  B 


B 


I  P  (a)  6  A  , 

la 


independent  of  the  piecewise-smooth  curve  Cg  which  we  choose 
to  connect  these  two  points  In  this  particular  case,  then,  noi  only 
is  this  difference  independent  of  Cg  but  so  also  is  each  of  its 
two  terms.  Thus,  when  n  -  1 ,  so  that  o>.r  problem  of  Type  1 
of  the  calculus  of  variations  is  a  plane  one,  we  have  the  following 
important  result: 

The  integral  J  P  (fi)  6  B  is  independent  of  the  particular 
CB 

piecewise-smooth  curve  Cg  that  we  select  to  connect  the  two 
arbitrarily  given  points  B^  and  Bg  of  R 

When  n  >  1,  this  result  is  not,  in  gpnera’  trie  since  the 
curve  ct  in  our  n-dimensional  locus  of  initial  points  is  not 
unambiguously  determined  by  its  end  points  Aj  and  A2  - 
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If,  however,  we  si  choose  the  n  x  1  velocity  ma'rix  x,  along 
oor  n  -  dimensional  locus  of  initial  pom's  that  the  c-  r-espandlng 
1  X  N  parametric  momentum  matrix  P  (a)  m  such  'ha'  the  integral 

P  (a)  6  A  is  unambiguously  determined  by  A,  and  Aj, 

it  being  the  same  for  ail  plecewise-smooth  curves  in  our  n- 
dimensionat  locus  of  initial  points  which  connect  A,  and  Aj , 

then  we  are  assu-cd  that  /  P  TO  4  B  i«  independent  -I  C* 

CB 

Writing  P  fa)  6  A  in  ihe  form  P  fa)  A,  ds  and  assuming  •!*' 

our  n  •  dimensional  locus  of  initial  points  ,s  simply  C'mect.d, 

we  see  that  a  sufficient  condition  for  J.  P  fa)  4  A  to  be 

CA 

independent  of  Is  that  P  <«)  As  be  the  gradient,  with 

respect  to  the  n  x  1  matrix  s  ,  of  a  continuously  differentiable 
function  of  s  .  In  particular,  if  this  function  is  a  numerical 
constant  so  that  P  ia>  A,  is  the  zero  1  x  n  matrix,  the 

integral  f  P  fat  6  A  is  independent  of  cj-  H  la,  in  fact, 

CA 

zero  for  all  plecewise-smooth  curve,  connecting  the  -wo  points 
A,  and  A2  In  this  case  we  say  that  our  n  -  perame'er  family 
of  extremal  curves  of  L  Intersects  transversal!)-  the 
„  .  dimensional  locus  of  initial  points,  for  example,  in  the  arc- 
length  problem,  every  member  of  our  n  -  parameter  family  of 
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* 


extremals,  each  a  segment  of  a  straight  line,  intersects 

at  right  angles  our  n  -  dimensional  locus  of  initial  points  when  the 

family  intersects  this  point  locus  transversally. 

We  term  our  n  -  parameter  family  of  extremal  curves  of  L 
an  extremal  field  over  R  when,  and  only  when,  the  integral 
I  P  (6)  6  B  which  we  shall  denote  by  I  ,  is  independent  of 
CB 

the  particular  piecewise-smooth  curve  Cfi  which  we  choose 
to  connect  the  two  arbitrarily  given  points  Bj  and  B2  of  R. 

At  any  point  X  of  R ,  the  velocity  nxl  matrix  xt  is  a 
continuously  differentiable  point  function  which  we  denote  by 
w  (X)  and  term  the  field  matrix  of  the  extremal  field.  When 
n  =  1,  any  1  -  parameter  family  of  extremal  curves  of  L  is  an 
extremal  field  over  R  if  it  possesses  the  following  two  properties: 

1)  Each  member  of  the  family  intersects  a  given  piecewise- 

smooth  curve  in  R. 

2)  R  is  simply  covered  by  the  family;  in  other  words,  there 
passes  through  each  point  of  R  one,  and  only  one,  member  of  the 
family.  This  will  be  the  case  if  the  2  -  dimensional  matrix  Xg 
is  nonsingular  over  R. 

When  n  >  1 ,  our  n  -  parameter  family  of  extremal  curves  of  L 
must  possess,  in  addition  to  these  two  properties,  the  following 
third  property  before  it  can  qualify  as  an  extremal  field  over  R : 
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3)  j  P  (a)  6  A  must  be  independent  of  :  in  other 

c A 

words,  P  (a)  A  ds  must  be  an  exact  differential  over  our 

s 

n  -  dimensional  locus  of  initial  points,  which  point  locus  we 

assume  to  be  simply  connected. 

When  our  n  -  parameter  family  of  extremal  curves  of  L 

is  a  field  over  R  we  term  the  integral 

1*  •  L  P  (6)  6  B 
CB 

the  Hilbert  invariant  integral  of  the  extremal  field.  In  this 
integral,  P  (fi)  »  P  (B,  w  (B)  )  where  w  (X)  is  the  n  x  1  field 
matrix  of  the  extremal  field.  Note:  P  =  P  (X,  Xf )  is  a  positively 


homogeneous  function,  of  degree  zero,  of  X^ ,  so  that 
.  X  v  X 

P  (X,  Xr)  =  p  (x,  where  j—  is  the  N  x  1  matrix 


and 


1  \  .  Hence,  P  (X,  Xr)  is  a  function  of  X  and  xt> 

xtl  7 

no  confusion  is  caused  by  writing  this  function  as  P  (X,  xt), 
since  xt  is  a  n  x  1  matrix  whereas  X  ^  is  a  N  x  1  matrix. 

If  the  two  points  Bj  and  B2  happen  to  Lie  on  a  member  of 
our  extremal  field  and  if  the  particular  curve  Cg  connecting 
these  two  points  which  we  select  is  an  arc  of  this  member,  then 
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P  (8)  =  P  (B,  w  (B)  )  =  P  (B,  Br  ),  and  I*  reduces  to 

r  fT* 

J  *  P(B,  Br)  Br  dT  =  ^  FdT. 

CB  1 

Thus: 

When  the  curve  C*  along  which  we  evaluate  Hilbert’s 

invariant  integral  I*  is  an  arc  of  a  member  of  our  extremal 
field,  I*  is  equal  to  the  integral 

rr2  „ 

I  -  I  Fd<  evaluated  along  this  arc. 

r, 

We  have  seen  that  our  n  -  parameter  family  of  extremal 
curves  of  L  will  be  an  extremal  field  of  L  over  R  if,  in 
addition  to  intersecting  a  given  n  -  dimensional  point  locus  and 
simply  covering  R,  it  intersects  this  n  -  dimensional  point 
locus  transversally.  By  writing  the  equation  of  our  n-dimensional 
point  locus  in  the  form  J(X)  =  0,  we  see  that  our  n-parameter  family 
of  extremal  curves  of  L  will  intersect  this  n-dimensional  point 

locus  transversally  if,  and  only  if,  Ap  =  Jx  over  il»  since 
5  X  =  0  is  the  only  constraint  on  6  X  over  the  point  locus. 

Here  A  is  an  undetermined  multiplier ,  and  we  ask  what  condition 
is  imposed  upon  our  n-dimensional  locus  of  initial  points  by 
the  requirement  that  A  have  the  constant  value  1  over  it. 
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Since,  under  this  requirement,  Jt  =  Pj ,  Jx  =  P  »  we  see  that 
J  +  H  (X,  J  )  =  0  over  the  n  -  dimensional  locus  of  initial  points. 

This  first  order  partial  differential  equation ,  which  we  write  as 
0<X,  Jx)  =  0,  where  0<X,  P)  =  Pj  +  H  (X,  p) ,  is  known  as 

the  Hamilton-Jacobi  partial  differential  equation.  Since  it  is 
insensitive  to  the  addition  of  a  constant  to  the  point  function 
j  -  j  (X) ,  we  see  that  if  we  set  P  =  Jx  at  each  point  of  any 

member  of  the  1  -  parameter  family  of  n  -  dimensional  point 
loci  J  (X)  =  C  ,  where  C  is  any  constant,  then  the  corresponding 
n  -  parameter  family  of  extremal  curves  of  L  will  cut  trans- 
versally  the  n  -  dimensional  point  locus  whose  equation  is 
J  (X)  =  C  .  On  differentiating  the  relation  0(X,  Jx)  =  0  with 

respect  to  X  ,  we  obtain  0X  ♦  0p*  Vx  a  0  •  Where  J  =  J  00 
is  a  solution,  which  we  assume  to  possess  continuous  second 
derivatives  with  respect  to  X ,  of  the  Hamilton-Jacobi  partial 
differential  equation  0(X,  Jx)  +  H  (X,  Jx)  -  0. 

Since  X(  =  (j bp,  by  virtue  of  the  defining  relation  P  * 
we  have 

0X*  :  ■  Jx* x Xt '  *  (Jx*)t 

and  it  follows  from  the  Euler -Lagrange  equation  Pt  -  1^  =  -<t>x 
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that  P  Jjj*  is  constant  along  any  extremal  curve  of  L.  If, 

then,  we  set  P  =  J„  over  our  n  -  dimensional  locus  of  initial 
points ,  this  relation  remains  valid  over  R  and,  hence,  over  that 
part  of  the  n  -  dimensional  locus  J  (X)  =  C  which  is  covered 
by  R.  Thus: 

If  J  -  J  (X)  is  a  solution  possessing  over  R  continuous 
second  derivatives  with  respect  to  the  N  x  1  matrix  X  and 
such  that  J  (X)  -  o  is  the  equation  of  an  n  -  dimensional  point 
locus,  then  the  n  -  parameter  family  of  extremal  curves  of  L 
determined  by  setting  P  1  Jx  over  this  n  dimensional  point 

locus  cuts  transversally  each  member  of  the  1  -  parameter 
family  of  point  loci  determined  by  the  equation  J  (X)  -  C ,  where 
C  is  any  constant;  the  relation  P  :  Jx  being  valid  over  each 


member  of  the  family. 

Example : 

For  the  arc- length  problem  in  N  dimensional  space, 


K  xr 


1/2 


problem,  L 

P 


1 


1  ♦ 


1  *  ' 
l.x,  xt| 

1/2  xfx( 

L 


-|f  -  so  that  P  P  =  1  .  For  this 
1/2 


x* 

t 

ps  TT‘  • 

l  . 

L 
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Now  L2  =  1  +  L2  p  p*,  so  that  =  <1  -  p  p*)  and  so 

H  (X,  p)  =  -  (1  -  p  p*W2  .  The  relation 
<£>  (X,  p)  s  Pj  ♦  H  (X,  p)  =  0  is  equivalent  to  the  relation 
p  *  (1  -  p  p*)1/2 ,  and  this  is,  in  turn,  equivalent  to  the  relation 
(Pj)2  =  1  -  pp*  ,  i.e.  P  P*  =  1 .  The  Hamilton-Jacobi  equation 
is  J  +  H  (X,  J  )  =  0,  and  this  may  be  written  in  the  more 
symmetrical  form  Jx  Jx*  =  1 .  Thus,  when  n  =  1 ,  the 
Hamilton-Jacobi  equation  is  |jt)  2  +  jjx]  2  =  1  ■  The  solutions 
of  this  equation  which  are  linear  functions  of  t  and  x  are 
J  (t,  x)  =-£  t  +  m  x ,  where  t2  ♦  m2  =  1 ;  the  1  -  parameter 
family  of  extremals  obtained  by  setting  P  =  Jx  over  the  curve 
■£  t  ♦  m  x  =  0  is  the  family  of  straight  lines  which  intersect  at 
right  angles  the  straight  line  whose  equation  is  ft  ♦  m  x  *  0. 
This  1  -  parameter  family  of  extremal  curves  of  L  also  inter¬ 
sects  at  right  angles  every  member  of  the  1  -  parameter  family 
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r 


I 


€  t  +  r  <  -  C  ,  where  C  is  any  constant.  If  we  introduce  plane 


polar  coordinates  (  r ,  0)  by  setting  t  =  r  cos  6  ,  x  *  r  sift  $  , 
the  equation  |jt  j2  +  |Jxj2  =  1  takes  the  form  J2  +  \  =  1  ’ 


and  the  solutions  of  this  equation  which  are  functions  of  r 


alone  are  of  the  form  r  =  constant.  The  1  -  parameter 
family  of  extremal  curves  obtained  by  setting  P  =  J  over  the 
circle  r  =  1  all  intersect  this  circle  orthogonally,  and  they 
also  intersect  orthogonally  each  member  of  the  family  of 
concentric  circles  r  =  C  ,  where  C  is  any  positive  constant. 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  9 

The  Weierstrass  E-Function;  Positively  Regular  Problems 
The  importance-^!  the  Hilbert  invariant  integral 

r 

I*  =  P  (X,  w  (X)  )  dX 

J 

in  the  problem  of  determining  sufficient  conditions  for  an  extremal 
curve  of  the  Lagrangian  function  L  of  a  problem  of  Type  1 
of  the  calculus  of  variations  to  be  a  minimal  curve  of  L  stems 

from  the  following  two  properties  of  I*  : 

1)  I*  is  invariant;  i.e.,  if  C  and  C  are  any  two  piecewise- 

smooth  curves  connecting  any  two  points  A  and  B  of  the 
region  R,  of  our  N  -  dimensional  time -coordinate  space,  then 

I*  (C)  =  I*  (C) 

2)  I*  reduces  to  I  when  the  curve  of  integration  is  a  member 
of  the  extremal  field  over  R  whose  n  x  1  matrix  field  function  is 

w  (X) . 

To  see  how  useful  these  two  properties  of  I  are,  let  C  be 
an  arc  between  two  points  A  and  B  of  R  of  a  member  of 
the  extremal  field  of  L  over  R  whose  matrix  field  function  is 


75 


« 


w  (X)  ,  and  let  C  be  any  piecewise-smooth  curve  in  R  whose 
end>polnts  are  A  and  B .  Then,  in  order  that  C  may  be  a 
minimal  curve  of  L,  1(C)  must  be  >  I  (C)  if  the  domain  D  of 
our  (2n  ♦  1  )  -  dimensional  state-space,  whose  projection  on  the 
time-coordinate  space  covers  R ,  is  suitably  restricted.  Thus 
it  is  necessary  to  appraise  I  (C)  -  I  (C),  and  this  appraisal  is 
difficult  because  the  curves  of  integration  C  and  C  are  different, 
we  avoid  this  difficulty  by  first  replacing  1(C)  by  1*(C),  which 
is  legitimate  because  of  Property  2,  and  then  replacing  I*(C)  by 
I*(C)  ,  which  is  legitimate  because  of  Property  1.  Thus  the 
difference  1(C)  -  1(C)  is  the  same  as  the  difference  1(C)  -  1*(C) , 
and  this  latter  difference  is  relatively  simple  to  appraise,  since 
the  curve  of  integration  is  the  same  in  each  of  the  two  integrals. 

£  is  furnished  by  an  equation  of  the  form 

X  =  X  (T)  ;  a  ^  T  <  B 

and  I*  (C)  =  |  P  (X ,  w  (X)  )  Xr  J  d  T  .  On  denoting  by 

W  (X)  the  N  x  1  matrix  whose  first  element  is  t  and  whose 
remaining  n  elements  are  those  of  t^. w  (X),we  have 
P  (X,  w  (X)  )  =  P  (X,  W  (X)  ) ,  where  P  (X  ,  W  (X)  is  a  positively 
homogeneous  function  of  degree  zero,  of  the  N  x  1  matrix  W  (X) 
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Thus  I*(C)=  J6|p(X,  W(X))Xr|dT  and,  since 

r  8  —  — 

I  (C)  =  /  F  (X,  X  )  d  r  ,  we  have 
•'a 

UC)  -1(C).  J6|f(x,  Xr)-  [p  (X,  W  (X) )  *T]J dT 

Since  P  (X,  W  (X) )  W  (X)  =  F  (X,  W  (X)  ),  we  nay  write  this 
equation  in  the  form 

1(C)  -1(0=  J  *(X,  Xr,  W(X))d  T 

where 

E  (X,  XT ,  W  (X)  )  =  F  (X,  XT>  -  F  (X,  W  (X) )  -  P  (X,  W  (X))^-  W  (X) ) 

M  X  =  X  ( T ) ,  a  <  T  <  fi  ,  is  any  piecewise-smooth  curve  in  R, 
the  function 

E  (X,  Xf,  W(X)  )  =  F  (X,  Xf)  -  P  (X,  W(X)  )  Xr 

=  F  (X,  Xr)  -  F  (X,  W(X)  )  -  P  (X,  W(X)  )  (Xr-  W(X)  ) 

is  piecewise  continuous  along  the  curve.and  I  (C)  -  I  (C)  ie  the 
integral  of  this  function  along  C  .  E  (X,  X^  W(X)  )  is  a  positively 
homogeneous  function,  of  degree  1,  of  the  two  N  x  1  matrices 
X  and  W  (X)  ,  i.e. ,  if  k  is  any  positive  number 

T 

E  (X,  k  X  ,  k  W(X)  )  -=  k  E  (X,  XT,  W(X>). 
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This  function  is  known  as  the  Weierstrass  E  -  function,  and 
our  task  is  to  appraise  the  integral  of  this  Weierstrass 
E  -  function  along  C.  If  E  is  nonnegative  along  C,  it  is 
certain  that  1(C)  >  1(C);  furthermore,  this  weak  inequality 
may  be  replaced  by  the  strong  inequality  1(C)  >  1(C)  if  E, 
in  addition  to  being  nonnegative  along  C,  is  positive  at  a 
single  one  of  its  points  of  continuity  on  C.  Hence,  C  is  a 
minimal  curve  of  L  if  E  is  nonnegative  over  R,  and,  furthermore, 
if  E  is  positive  at  all  except,  possibly,  a  finite  number  of  points 
of  any  piecewise-smooth  curve  C  in  R  connecting  the 
end-points  A  and  B  of  C  ,  I  (C)  <  I  (C),  so  that  C  furnishes 
an  absolute  minimum  of  our  integral  I .  The  importance  of  this 
result  is  that  it  furnishes  a  uniqueness  theorem  for  minimal 
curves  of  L  through  the  points  A  and  B  of  R  .  Indeed,  if 
C  and  C  were  two  different  minimal  curves  of  L  connecting 
the  points  A  and  B  of  R ,  we  would  have  the  two  contradictory 
inequalities  I  (C)  <  I  (C) ,  I  <CK  I  (C) .  This  uniqueness 
theorem  is  on  a  plane  of  mathematical  difficulty  quite  different 
from  the  more  elementary  uniqueness  theorem  of  ordinary 
differential  equations, which  states  that,  if  L  satisfies  over  D 
certain  not  very  restrictive  conditions,  then  there  passes  through 
each  point  z=|*J  of  D  an  unambiguously  determinate  curve  T 
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which  is  the  image  of  an  extremal  curve  of  L  We  may  term 
this  a  local  uniqueness  theorem, since  the  data  which  are  necessary 
to  determine  the  unique  extremal  curve  of  L  are  the  coordinates 
of  a  single  point  z  of  D  On  the  other  hand,  the  uniqueness 
theorem  which  we  are  now  encountering  and  which  depends  on 
the  positiveness  of  the  Weierstrass  E  -  function,  may  be  termed 
a  global  uniqueness  theorem, since  *he  data  which  are  necessary  to 
determine  the  unique  minimal  curve  of  L  arp  the  coordinates  of 
two  arbitrarily  chosen  points  a  and  B  of  R  In  order  to 
avail  ourselves  of  the  proof  just  given  of  the  validity  over  R 
of  this  global  existence  theorem, we  must  be  certain  that  our 
Lagrangian  function  L  possesses  the  following  two  properties: 

1)  Any  extremal  curve  of  L  that  connects  any  two  points 

A  and  B  of  R  may  be  imbedded  in  an  extremal  field  of  L  over 
R  . 

2)  The  Weierstrass  E  functions  of  the  various  extremal 
fields  of  L  referred  to  in  l)  ,  are  all  essentially  positive.  By 
this  we  mean  that  these  E  -  functions  are  nonnegative  and  such 

that  the  integral  of  E  over  any  curve  connecting  the  points  A  and 
B,  other  than  the  extremal  curve  which  is  imbedded  in  the  field, 
is  positive.  For  example,  E  is  essentially  posi‘ive  if  it  is  non- 
negative  and  is  zero  at  not  more  than  a  finite  number  of  points  of 
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any  piecewise-smooth  curve  connect ing  A  and  B  other  than 
the  extremal  curve  of  L  referred  to. 

The  arc-length  problem  in  N  -  dimensional  Euclidean  space 

is  one  for  which  it  is  easy  to  prove  the  global  existence  theorem 

without  the  necessity  of  Integrating  the  Euler- Lagrange  equation, 

since  the  Weierstrass  E  -  function  takes  a  particularly  simple 

form  in  this  problem.  We  have  already  seen  that  any  region  of  our 

N  -  dimensional  time-coordinate  space  may  be  covered  by  an 

extremal  field  of  F  =  (X^X^,)1  2  consisting  of  parallel  open 

line-segmerts.  If  A  and  B  are  any  two  points  of  this 

N  -  dimensional  Euclidean  space.the  line-segment  A — »B  is  an 

1  /2 

extremal  curve  of  the  Lagrangian  function  L  =  (1  ♦  xt  xt) 

or,  equivalently,  of  the  parametric  integrand  F  =  (X*,  X^.  )  2, 

and  we  take  as  our  extremal  field  of  L  the  field  whose  line- 
segments  are  parallel  to  the  line-segment  A — »B.  Then  P  (X,  W(X)  ) 
is  the  unit  N  x  1  matrix  which  furnishes  the  direction  of  the  line- 
segment  A — *B  so  that  P(X,  W(X))X  -  F  (X,  Xr  )  cos  6, 
where  6  is  the  angle  from  the  line-segment  A-  'B  to  the  tangent, 
in  the  sense  in  which  T  is  increasing,  to  the  curve  C,  whose 
equation  is  X  =  X  (T),  a<  T  <  6.  Hence 
E  (X,  X  ,  W(X)  )  =  FIX,  X  )  (1  -  cos £)  which  is  nonnegative, 

T  T 
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since  F  (X,  X  )  >  0  .  Furthermore,  F  (X,  X^  )  >  0  except 
possibly,  at  a  finite  number  of  points  of  C  ,  since  C  is,  by 
hypothesis,  piecewise  smooth,  and  so  E  is  positive  along  C 
except  for  this  finite  number  of  points  and  the  points  where  0  =  0. 

If,  then,  there  is  one  point  of  C  not  belonging  to  the  finite 
number  of  points  referred  to,  at  which  6  t  0  ,  the  integral  of 
E  along  C  is  positive  so  that  I  (C)  >  I  (C) .  U  0  =  0,  except 
possibly  at  a  finite  number  of  points  of  C  ,  C  is  a  line-segment . 
and  so  must  coincide  with  C .  since  it  passes  through  the  points 
A  and  B .  This  completes  the  proof  of  the  global  uniqueness 
theorem  for  the  arc -length  problem  in  N  -  dimensional 
Euclidean  space. 

It  follows  from  the  extended  theorem  of  the  mean  of  differential 
calculus  that 

E  (X,  X  ,  W(X)  )  =  F  (X,  XJ  -  F  (X,  W(X)  )  -  P  (X,  W(X)  )  0^-  W(X)  ) 
a[xr-  W(X)}*FX.V(X,  V,(XT-W,X). 

where  V  •  W  (X)  e  f  ta.  -  W(X)J  ,  0  <  6  <  1 .  Since  the  first 
elements  of  the  two  N  x  1  matrices  Xr  and  W(X)  are  the  same, 
namely  T  and  since  the  n  -  dimensional  matrix  obtained  by 

T  ’ 

erasing  the  first  row  and  the  first  column  of  the  N  -  dimensional 

matrix  F-*  x  is  the  quotient  of  Lx*  x  by  t  , 

t  t 
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I* 

* 

F  being  L  t  ,  E  (X,  3^,  W(X)  )  may  be  written  in  the  form 

*  L  *  (X,  v)  x  -  w(X)  times  t  ,  where  w(X) 

xt  xt  '  1  » 

is  the  n  x  1  matrix  field  functioned  v  =  w(X)  *  w(X)j  • 

Thus 

I  (C)  -  I  (C)  =  \  J  J  xt  -  w(X)  j  Lx*^  (X,  v)Jxt  -  w(X)|  dt 

We  say  that  our  problem  of  Type  1  of  the  calculus  of  variations 

is  positively  regular  over  our  (2n  ♦  1)  -  dimensional  domain  D, 

if  the  n  -  dimensional  matrix  L  *  is  positively  definite 

xt  t 

over  D.  We  assume  that  D  has  the  type  of  convexity  which 

X  1  X 

is  implied  by  the  statement  that  if  w  and  w  are  any  two 

1'  2 

points  of  D  which  have  the  same  projection  on  R  ,  then  the  line 
segment  connecting  these  two  points  is  covered  by  D  .  Since 

I*  -  andl-  are,  by  hypothesis,  points  of  D, so  also  is 
1  w(X)  /  l  *t 

|*  J  and.  so,  for  a  positively  regular  problem,  E  is  positive  at 

those  points  of  C  for  which  xt  -  w(X)  is  not  the  zero  n  x  1 
matrix.  Hence,  I  (C)  >  1  (C)  unless  x^  =  w(X)  along  C  , 

when  this  is  the  case,  C  is  a  member  of  our  extremal  field  over 
J  R  and,  since  it  passes  through  A,  it  must  coincide  with  C. 

Consequently,  we  see  that  C  furnishes  an  absolute  minimum  of  1(C)  for  all 
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piecewise-smooth  curves  C  connecting  the  points  A  and  B 
which  are  covered  by  R.  Thus  the  global  uniqueness  theorem  is 
valid  for  positively  regular  problems  of  Type  1  of  the  calculus  of 
variations  (provided  that  L  possesses  the  Property  1  concerning 
the  possibility  of  imbedding  any  extremal  curve  of  L  , 
covered  by  R,  in  an  extremal  field  of  L  over  R  )  . 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 


Lecture  10 

A  Simple  Example  of  the  Construction  of  an  Extremal  Field; 
Rayleigh  Quotients  and  the  Method  of  Rayleigh-Ritz 

We  r.ow  consider,  in  order  to  have  a  simple  example  of  the 
construction  of  an  extremal  field,  the  plane  problem  of  Type  1 
of  the  calculus  of  variations  whose  Lagrartgian  function  is 

L=lje*V2dVe*2*2|’‘j 

where  the  coefficients  c,  d,  e,  f,  are  either  constants  or, 
possibly,  functions  of  t  .  The  Lagrangian  momentum  is 

p  =  c  xt  ♦  d 

and  the  matrix  L  #  ,  here  1  -  dimensional,  is  c.  The  Euler- 

x  t  xt 

Lagrange  equation  is 

c xtt +  ct  W e  x *{ 

and  we  assume,  in  order  to  be  assured  of  the  uniqueness  as  well 
as  the  existence  of  a  solution  of  this  equation  for  which  the  initial 
values  of  x  and  x(  are  arbitrarily  prescribed,  that  c  and  d 
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possess  continuous  second  derivativeSjand  that  e  and  f  possess 
continuous  first  derivatives  over  a  given  open  interval 
t  <  t  <  tj .  The  problem  is  positively  regular  over  any  region  D 

of  our  3  -  dimensional  state-space  whose  projection  on  the  (t,  x)- 
plane  is  covered  by  the  strip  tg  \  t  <  t^  of  this  plane,  which 

strip  is  parallel  to  the  x  -  axis  if  c  is  positive  over  the  interval 
t  C  t  <"  tj  ,  and,  when  this  is  the  case,  the  strong  Legendre  con¬ 
dition  is  satisfied  at  every  point  of  every  extremal  curve 
covered  by  the  strip  tg  <  t  <  t^  .  Let  x  =  x(t),  a^  where 

tQ  <  a  <  b  <  tt ,  be  an  arc  C  of  any  extremal  curve 
coveredby  the  strip  tg  <  t  <  tj  ,  and  let  x  =  x  (t),  a  ^  t^  b 
be  any  piecewise-smooth  curve  C  ,  which  is  covered  by  the  strip 
t  <  t  <  tj  and  which  has  the  same  end-points  as  C  .  Writing 

x  (t)  =  x  (t)  +  y  (t ) ,  so  that  both  y  (a)  and  y  (b)  are  zero,  we  have 


and  it  follows,  since 

1(C)  =  3  Jb  [C  x2t+  2dxt  +  ex2+  2fxj  dt 
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that 


I  (C)  -  I  (C) 


rbi 


l 


(cx(  *  d)  y(  ♦  ^  cy?'t  ♦  (ex  +  f)  y  *  ^  eV  j  dt 


Upon  integration  by  parts  wo  obtain,  since  both  y  (b)  and 

y  (a)  are  zero, 
r  b  r  b 

(c  x  +  d)  y  dt  -  I  (c  x  •  d)  y  dt  and,  since  C  is  an 
J  t  t  *  ' 


extremal  curve,  this  may  be  written  as  -  I  (ex  *  0  y  dt  .  Thus 

Ja 


1  r D  2  2 

I  (C)  -  I  (C)  -  g  I  (c  y  t  ♦  e  y  )  dt 


and,  if  e  >  0  over  the  interval  ai,tv  b,  the  right-hand  side 

«  r  b  2 

of  this  equation  is  c  y  t  dt  which  is,  in  turn,  >0. 

The  weak  inequality  may  be  replaced  by  the  strong  inequality 

>  unless  y  =  0  at  all  its  points  of  continuity.  Since  y  (a)  0, 

this  would  imply  that  y  -  0  over  the  interval  a  <  t<  b  Thus 
the  inequality  e  >0  over  tQ  '  t  •  tj  IS  sufficient  to  ensure  that 

1(C)  >  1(C),  where  C  is  any  arc  of  any  extremal  curve  which  is 

covered  by  the  strip  tQ  ■  t  ■  ^  , and  C  is  any  piecewise-smooth 

curve  which  is  covered  by  this  strip  and  has  the  same  end-points  as 
C  (it  being  understood  that  C  is  furnished  by  an  equation  of  the 
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form  x  =  x  (t) ,  a  <  t  <  b .  so  that  no  two  points  of  C  have  the  same 
projection  on  the  t  -  axis).  The  strong  inequality  I  (C)  >  I  (C) 
assures  us  that  there  does  not  exist  more  than  one  extremal  curve 
connecting  any  two  given  points  A  :  fa  ,  x  (a)  )  and  B  :  fb  ,  x  (b)  ) 
of  the  strip  tQ  <  t  ■  ;  indeed,  if  two  such  extremal  curves  Cj 

and  Cj  existed  ,  we  would  have  on  taW'nq  Cj  as  C  and 

C2  as  C  the  inequality  I'C2>  •  KCj)  ,  and  we  would  also  have 

on  taking  C2  as  C  and  as  C  the  contradictc  /  inequality 

I  (C  )  _■>  I  (C  ) .  Thus  any  arc  of  any  extremal  curve  whfch  is 
1  2 

covered  by  the  strip  tQ  *' '  t  <  is  a  minimal  curve  ,  and  no 

other  curve  connecting  the  end-points  of  this  arc  is  a  minimal  curve. 
Moreover,  this  arc  of  the  extremal  curve  is  not  only  a  minimal 

f  b 

curve  but  it  yields  an  absolute  minimum  of  the  integral  I  r  J  L  dt  ■ 

d 

The  condition  e  >  0  ,  tQ  t  •  tj  .  while  sufficient  to  yield 

the  strong  inequality  I  (C)  >  I  (C) ,  where  C  is  any  piecewise-smooth 
curve  of  the  form  x  =  x  (t) ,  aMO.  with  the  same  end-points 
as  C  ,  is  not  necessary  for  the  validity  of  this  inequality.  For 
example,  let  us  consider  the  case  where  the  coefficients,  c,  d,  e, 
and  f  are  all  constant,  c  being  1,  d  and  f  being  zero,  and  e 
being  -k2  where  k  >  0  .  Our  Euler- Lagrange  equation  is 
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x  +  k  x  =  0,  so  that  our  extremals  are  of  the  form  x  s  cos  (kt  -  6), 
where  s  and  6  are  constants  of  integration  .  We  take  as  our  given 

M  \  /  M 

end-points  the  points  o  ’  0  and  as  °Ur  reKlon  D  of  our 

3  -  dimensional  state-space  the  slice  -  1  -  €  t  1  *  €  , 
perpendic’-’ar  to  the  t  -  axis,  where  €  is  a  positive  number  which 
we  ,na  ,  .ake  to  be  as  small  as  we  please.  On  setting  6  0  we 

,-bta.i  ’-he  1  -  parameter  family  of  extremal  curves  x  s  cos  kt  , 

all  of  which  intersect  the  x  -  axis.  In  order  that  this  1  -  parameter 
family  of  extremal  curves  be  an  extremal  field  over  the  strip 
_  l  -  f  •.  t  1  -  £  ,  all  that  is  necessary  (since  n  1)  is  that  it 
simply  cover  this  strip, and  the  condition  for  this  is  that  the 
Jacobian  matrix  of  ^  with  respect  to  ^  J  be  nonsingular  over 

the  strip.  The  determinant  of  this  Jacobian  matrix  is  Xg  cos  kt 

and  so  the  1  -  parameter  family  of  extremal  curves  x  s  cos  kt 

will  be  an  extremal  field  over  the  strip  -  1  -  €  i  1  ♦  f:  if, 

and  only  if,  k(l  -  ()  •  ^  or’  t>quivaIen,lV-  sin,  e  C  is  arbitrarily 

small  if,  and  only  if.  k  \  .  (If  k  \  all  the  extremal  curves 

of  the  family  x  s  cos  kt  have  the  common  points 

that  the  family  does  not  simply  cover  the  strip  -  1  -  ( 
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no  matter  how  small  the  positive  number  £  ).  When  k  <  ^  , 

so  that  the  1  -  parameter  family  of  extremals  x  =  s  cos  kt  is 

an  extremal  field  over  the  strip  -l-f<t<l+£  iffis 

sufficiently  small,  the  field  function  w  (t,  x)  is  -  k  x  tan  k  t , 

12  2  2 

since  x^  =  k  s  sin  k  t  =  -  k  x  tan  k  t  .  Since  L  =  ^  (x  ^  k  x)> 

we  have  Pj  =  L  -  px^  -  -  ^  (x  t  +  k  x  ) »  f*2  =  ^  =  xt  SO 

P  (X,  W  (X)  )  X  -  -  \  k2  X2  (tan2  kt  *  1)  -  k  x  tan  k  t  xt 

(our  parameter  T  being  the  Lagrangian  parameter  t  ). 

Hence,  the  Weierstrass  E  -function  is  furnished  by  the  formula 

E  (X,  xt,  w  (X)  )  --  L  -  P  (X,  W  (X)  )  Xt  =  \  (x2t  *  k2x2  tan2  kt)  +  lJC  tan  kt  xt 

1  2 
=  j  (x(  +  kx  tan  kt  x() 


b_  _  f  -  _  -2 

I  (C)  -  I  (C)  =  E  (X,  xt,  w  (X)  )  dt  =  5  J  (xt  +  k  x  tan  kt  xt)  dt  >  0  . 

a  a 

The  weak  inequality  y  may  be  replaced  by  the  strong  inequality  > 
when  C  does  not  coincide  with  C  .  for  the  equality  would  imply  that 
x’t  *  k  x  tan  kt  xt  =  0  at  all  the  smooth  points  of  C  ,  and  this  would, 

in  turn,  imply  that  x  =  s  cos  kt ,  where  s  is  a  constant,  at  all  the 
smooth  points  of  C  .  Since  cos  kt  i  0  over  the  strip  -  1  -  €<  t  <  1  ♦  € , 
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,f  f  „  sufficiently  small,  and  sinc.  x  0  at  the  . nd-points  of  our 


extremal,  the  constant  of  integration  s 


must  be  zero  ,  so  that 


—  .  .  ...  n  Xh„  mfmhpr  C  of  our  extremal  field 

C  must  coincide  with  C  The  mcniner  u 

x  =  scoskt,  -  1  -t'  t  l‘£.  which  passes  through  the  two 

points  (-J)  and  (  J)  ,  is  furnished  bv  setting  s-0,  this  member 

being  the  line  segment  connecting  these  two  points;  thus  x  and  x(  , 

and  consequently  L  are  zero  along  C  .  so  that  1(C)  0  .  Hence, 
I  (C)  >  0,  it  being  always  understood  that  0  k  2  '  ThlS  stnkinR 

result  may  be  stated  as  follows: 

If  x  =  x  (t),  -  1  <-  t  O  ,  is  any  piecewise-smooth  curve 

.  ,  M  other  than  the  line  segment 

connecting  the  two  points  |  q  )  ’  0  j  ’ 

r  1  1 

which  connects  these  points,  the  integral 


J-\ 


x2(  dt  is  greater 


than  k2  times  the  integral 


(  x2  dt  ,  w 
J-\ 


here  k.  is  any  positive 


number  <  \  ;  or,  equivalently,  if  x  -  x  ft)  ,  -  1  t  1  ,  is  any 


1  1  II 


piecewise-smooth  curve  connecting  the  two  points  |  0  j  .  |0  |  > 
other  than  the  line  segment  which  connects  these  two  points,  the 

/lx2tdt 

2  J.\  1 

number  V  is  <  the  quotient  - - 

f  ;2d, 

-1 
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2 

Thus  —  is  a  lower  bound  for  the  collection  of  such  quotients 

4 

That  it  is  the  greatest  lower  bound  of  this  collection  becomes  clear 


on  taking  x  =  A  cos  "  t ,  -  Kts  1  ,  where  A  is  any 


constant,  since,  for  this  smooth  curve  which  connects  the  two 


points 


l-l) 

/ 1 

and 

W 

the  quotient  in  question  is 


may  regard,  if  we  wish,  this  property  of  the  number 


2 

17 

4 

2 

ir 

T 


We 


as  a 


definition  of  the  number  rr  ,and  we  may  use  this  definition  to 
appraise  the  number  u  .  Thus  if  we  set,  for  example, 


x 


14^4 


the  quotient 


2.  5  , 


2 

and  we  know  that  -^-4  2.5. 

2 

Actually  ^-=  2.4674  to  4  decimal  places,  so  that  the 
appraisal  is  astonishingly  good. 

1  2  2  2 

Our  Lagrangian  function  L  =  j  ^  x  )  ^ 

regarded  as  the  Lagrangian  function  of  a  mechanical  system, 
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with  one  degree  of  freedom,  for  which  the  kinetic  energy 

12  2 

T  ^  xt2  and  the  potential  energy  V  2  k  X  ' 

The  Euler- Lagrange  equation  of  this  problem  namely, 

x  ♦  k2  x  0,  has,  when  k  ^  0  the  general  solution 
tt 

x  s  cos  (k  t  -  r.) ,  and  for  this  curve  to  pass  through  the  two 

without  being  the  line  segment  connecting 

these  two  points,  we  must  have  cos  (K  ♦  6)  r  0  ,  cos  (k  -  f>)  -  0. 
These  two  equations  are  equivalent  to  the  two  equations 
cos  k  cos  ft  0  ,  sin  k  sin  6  0  .  From  the  first  of  these  two 

equations  we  set*  that  either  k  or  6  must  be  an  odd  integral 

multiple  of  |  ,  and  from  the  second  we  see  that,  if  *  is  an 

odd  integral  multiple  of  2  •  then  ^  IS  a  nonz,'r,)  integral 

multiple  of  it.  In  either  event ,  k  is  a  nonzero  integral 
multiple  of  ~2  .and  we  term  the  squares  of  these  nonzero 

integral  multiples  of  ij  th<>  characteristic  numbers  of  the 

boundary- value  problem  turmshed  t » v  the  difierenli.il 

equation  x  ♦  \x  0  and  the  boundary  conditions  x(-l)  0, 

2 

x(l)  =  0  .  The  least  of  these  characteristic  numbers  is  j  • 
and  we  have  seen  that  this  least  characteristic  number  is  the 
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greatest  lower  bound  of  the  collection  of  quotients 


J 

-i 

"7  1 


for  all  piecewise-smooth  curves  of  the  form  x  x(t), 

1 


which  connect  the  two  points 


-1 

0 


,  the  line  segment 


that  connects  these  two  points  being  excluded.  We  denote  any 
one  of  this  collection  of  quotients  by  the  symbol  R ,  since  the 
connection  between  these  quotients  and  the  least  characteristic 
number  of  certain  boundary-value  problems,  whose  charade 
numbers  are  all  positive,  was  first  stated  by  Rayleigh; 
and  we  term  R  the  Rayleigh  quotient  for  the  piecewise-smooth 
function  *.x(t).-loUl.  which  is  arbitrary  except 

that  it  must  satisfy  the  boundary  conditions  without  being 
identically  zero. 

Let  tts  now  consider  a  lamily  o(  piecewise-smooth  iunctions 

x  =  X  (t,  s) .  '  >  <  -  1  '  d<!Pendlng  on  a  Paranwter'  °r  I*r,,me,riC 

TOtrlx  s  ,  the  only  care  necessary  in  the  choice  ot  this  lamily 

being  that  each  oi  .is  members  mas.  satisfy  the  boundary  conditions 
Then  .he  Rayleigh  quotient  »■»(•)  <«•  merab"  °‘  'he 


family  which  is  not  identically  zero  will  be  a  function  of  the 

parameter,  or  parametric  matrix  s.  No  matter  what  the 

value  of  s  ,  R  (s)  will  be  >  the  least  characteristic  number 
2 

we  are  seeking  {  ^  in  the  simple  example  we  have  chosen  to 

illustrate  the  method)  By  selecting  s  so  as  to  minimize  R  (s), 
we  obtain  the  best  approximation  to  the  least  characteristic 
number  we  are  seeking  which  we  can  obtain  by  the  use  of  the 
family  we  have  adopted.  This  modification  of  Rayleigh’s 
procedure  was  proposed  by  Ritz.and  the  method  is  known  as  the 
Rayleigh-Ritz  method.  For  example,  in  the  problem  we  are 
considering,  we  may  set  x  (t)  (1  -  t  )  (1  ♦  s  t  ) ,  the  factor 

1  -  t2  automatically  taking  care  of  the  boundary  conditions  and 
the  other  factor  being  taken  to  lie  1  ♦  s  t  rather  than  1  ♦  t,  t  , 
in  view  of  the  symmetry  of  the  problem  about  t  0  .  A  simple 


calculation  yields  R  (s) 


2 

3  35  14  s  ♦  1 1  s 

2  \  2 1  +  6  s  4  s? 


and  this  has 


a  minimum  value  when  s  is  the  greater  of  the  two  roots  of 

the  quadratic  equation  26  s2  4  196  s  *  42  0  .  Taking  s  -  -  0.  22  , 

which  is  this  root  to  two  decimal  places,  we  find  R  (s)  =  2.  467438 

2 

to  6  decimal  places  and  this  is  greater  than  -  2.  467401  by 
less  than  4  units  in  the  fifth  decimal  place.  Observe  that  the 
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minimizing  value  of  the  pa -a meter  s  does  not  have  to  be 

determined  with  great  precision, since  the  Rayleigh  quotient 
R  (s) ,  the  quantity  in  which  we  are  interested,  is 
stationary  at  this  minimizing  value. 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  11 


1 


The  Principle  of  Maupertuis;  The  Propagation  of  Waves 

We  have  seen  how  to  pass  from  the  Hamiltonian  function 
H  (X,  p)  of  a  problem  of  Type  1  of  the  calculus  of  variations 
to  the  Lagrangian  function  L  (X,  x^)  of  this  problem,  all  we 

have  to  do  is  to  solve  the  equation  Hp  (X,  p)  =  x(  for  p  as  a 

function  of  X  and  and  set  L  -  p  x^  -  H  (X,  p).  In  order  to 

present  this  passage  in  parametric  form, we  observe  that  the  role 
of  the  Hamiltonian  function  H  (X,  p)  is  to  furnish,  through 
the  equation 

(p  (Z)  -  Pj  ♦  H  (X,  p)  =  0 

the  (2N  -  1)  -  dimensional  point  locus,  in  the  extended  phase  space 


whose  points  are  Z  - 


on  which  lie  the  images  of  all 


piecewise-smooth  curves  in  our  N  •  dimensional  time- coordinate 
space.  The  particular  form  in  which  the  equation  of  this  (2N  -  1)  - 
dimensional  point  locus  is  written  is  of  no  significance, and  it  is 
sometimes  convenient,  for  reasons  of  symmetry,  to  change  this 
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form  (without  changing,  naturally,  the  point  locus  itself).  For 
example,  for  the  arc -length  problem  in  the  plane, 

xt 

L “ * x.2)‘/2 •  77^7*  ’  x,=  ^W75  ' 


H  (X,  p)  =  P  \ 


-  L  =  -d 


PV72 


so 


that  0(Z) 


P1  -  (1  - 


p2)172 


The  point- locus  in  the  4  -  dimensional  extended  phase  space  whose 


equation  is  0(Z)  i  P,  -  (1  -  Pj2>  1/2  =  0  may  be  more 

1  2  2 

symmetrically  described  by  the  equation  (Z)  *  5  (P,  *  p2  ‘  11  ‘  °' 

We  write  any  such  equation  ol  our  (2N  -  1)  -  dimensional  point  locus 
as  i/l(Z)  -  0  ,  where  l//(Z)  is  taken  to  be  a  continuously  differentiable 
function  of  Z  .  The  gradient  0Z  of  0  is  a  multiple,  In  general 
varying  with  Z  ,  of  the  gradient  0  z  of  0,  and  this  latter  gradient 

is  (Hx;  1,  Hp>.  The  relation  p  ■  ly  which  defines  the  Lagrangian 

momentum  matrix  p ,  is  equivalent  to  the  relation  x,  >  Hp ,  and  so 
the  last  N  of  the  elements  of  0Z  are  those  of  X(  =  X^,  T(  . 


Hence  the  last  N  ol  the  elements  of  0Z  are  proportional  to  those 


Of  ,  and  we  write 

T 


97 


xr  -  A.i/cp 

where  A  is  a  multiplier  which  depends  on  The  particular  equation 


Z)  =  0 


we  have  chosen  to  represent  our  (2N  -  1)  - 


dimensional  point  locus  in  our  2N  -  dimensional  extended  phase 
space,  A  being  t^.  when  We  term  the  function  (//"( Z) 

a  parametric  Hamiltc..ian  function,  and  in  order  to  pass  from  any 
parametric  Hamiltonian  function  I jf  to  the  corresponding 
parametric  integrand  F  =  F(X,  X^. ) ,  we  proceed  as  follows: 


1)  We  solve  the  N  ♦  1  equations 

Al/ep  -  Xr  ;  \fj{ Z)  =  0 

for  P  and  A  as  functions  of  X  and  X^,  .  In  order  to  be 
assured  of  the  possibility  of  carrying  out  this  step, we  assume  that 


the 


(N  +  1)  -  dimensional  matrix, 


^pp*  ftp  1 
'ftp-  0  | 


exists  and 


is  continuous  and  nonsingular  over  a  region  of  our  2N  -  dimensional 
extended  phase  space  which  covers  the  closed  point  set  in  the  space 
furnished  by  the  equation  (Z)  =  0.  The  determinant  of  the 


(H+  1) 


dimensional  Jacobian  matrix  of 


with  respect  to 


fp*] 

\xftpp-ftp\ 

Uj 

,  namely, 

l  ftp-  0 . 

is  the  product  of  the 
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I  ^pp*  \ 

determinant  of  the  (N  ♦  1)  -  dimensional  matrix  ^  y  q  j 

by  ‘  1  and  so  our  hypothesis  assures  us  of  the  possibility 
of  solving,  without  ambiguity,  the  N  ♦  1  equations  Al/z'p  =  X^.  , 

=  0  for  P  and  A  as  functions  of  X  and  X^.  at  all 

points  where  X^  is  not  the  zero  N  x  1  matrix  (so  that  A  i  0) 

The  function,  A  =  A(X,  *T  ) ,  which  we  obtain  in  this  way  is 

a  positively  homogeneous  function,  oi  degree  1,  of  X^. ,  whereas 

elements  of  the  lxN  matrix  P  are  Positively  homogeneous 
functions,  of  degree  zero,  of  Xf ,  since,  from  any  solution 

(P,  A)  of  our  N*  1  equations,  we  obtain  a  solution  (P,  kA) 
of  the  N  ♦  1  equations  A  =  k  X^.  ,  l/^(Z)  =  0  .  Note.  The 

adjective  ’’positively"  is  necessary, since  A  may  be  determined 
by  means  of  an  algebraic  equation  of  degree  higher  than  1 . 

For  example,  for  the  arc-length  problem  in  the  plane,  if  we  take 

\p{l)  to  bo  i  (p,2  ♦  P22  -  1 )  .  °“r  lre  A  P1  ■  V 

Ap2-xt  ,  P,2*P22=l,»nd  A 2  =  «r2*V  •  ^ 

A=  |t  2  +  x  2  |  1/2  ,  we  obtain  a  function  of  X^.  which  is 


the 
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positively  homogeneous  of  degree  1  but  not  homogeneous  of  degree  1  . 

2)  We  set  F  (X,  Xr  )  P  *r  ,  thus  obtaining  a  positively 

homogeneous  function  of  degree  1,  of  Xf  .  The  function  F(X.Xr>, 
which  we  obtain  in  this  way, is  the  parametric  integrand  of  a  calculus 
of  variations  problem  of  Type  1  of  which  i//(Z)  is  a  parametric 
Hamiltonian  function  Indeed  Fx  P  *  X*  P*x  p*  since  P 

r  '  r 

is  a  positively  homogeneous  function,  of  degree  zero,  of  ■ 

The  integral  1=  /  6  F  (X,  X  )  d  T  may  then  be  presented  as 

the  line  integral  J  P  dX  in  our  2N  dimensional  extended  phase 

space,  and  the  equations  of  the  extremals  of  the  constrained  problem 
of  Type  1  of  the  calculus  of  nations  which  is  furnished  by  this 
integral  and  the  constraint  ://(Z)  -  0  are 

dxL  =  =  AdT,  j.  k  =  1 . N. 

^P] 

All  that  remains,  then,  in  order  to  justify  our  statement  that  \j/(Z) 
is  a  parametric  Hamiltonian  function  of  the  calculus  of  variations 
problem  of  Type  1  ,  whose  parametric  integrand  is  F  ,  is  to 
show  that  Pr  -  A  along  any  extremal  of  this  problem  or, 


equivalently,  to  show  that  Fx  -  A  y  x  On  differentiating 

the  equation  \ff[ Z)  -  0  with  respect  to  X,  (not  P)  being 
held  constant,  we  obtain  l//x  ♦  V^p*p*x  =  0  or>  equivalently, 

Ail/V  +  X*  P*  -0.  Since  F  -  P  X  -  X*  P*  ,  Fv  -  X*  P*  , 

r  x  r  x  r  r  XT* 

so  that  Fx  -  • 

Example.  We  consider  the  case  of  a  conservative  mechanical 
system  with  n  degrees  of  freedom  for  which  T  -  x*t  G  , 

where  the  n  dimensional  symmetric  matrix  G  is  a  function 

of  x  alone,  not  involving  t  ,  and  V  -  V  (x)  is  also  a  function 
of  x  alone.  Then  L  =  j  x’(  G  x(  V  ,  p  -  x*{  G ,  x(  -  G  1  p*  , 

it  being  understood  that  G  is  nonsingular  .  Thus  T  -  |  p  G  1  p* 

and  H  (x,  p)  -  ^  p  G  1  p*  ♦  V  .  Our  (2N-1)  -  dimensional  point 

locus  in  the  2N  -  dimensional  extended  phase  space  has  the 

equation  Pi  ♦  \  P  G  1  p*  ♦  V  -  0  and,  since  H(  -  0  ,  we  know 

that  along  any  gi’-en  extremal  cur  e  of  our  calculus  of  variations 
problem,  H  has  a  constant  value,  E  say,  where  E,  the  energy 
of  the  conservative  mechanica J  system,  varies,  natural!),  from 
one  extremal  cun-e  to  anothe*-.  In  order  to  discuss  an  extremal 
curve  for  which  the  value  of  E  is  given}we  may  write  i  p  G  *  p* 

+  V  -  E  0,  since  Pj  -H,  and  treat  the  left-hand  side 
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U  -hi.,  equation  as  a  parametric  Hamiltonian  function  m  a 
alcuius  ot  variations  problem  whose  extended  phase  .pace  is 
2N  -  :  dimensional,  rather  than  2N  -  dimensional,  any  point 
i  -he  new  extended  phase  space  being  furnished  by  a  “n  x  1 

matrix  *  ,  ,  rather  than  a  2N  x  1  matrLX  |  p  ,  ■  When  *e 

proceed  in  this  way  we  say  that  we  nave  ignored  the  time 
coordinate,  and  we  may  similarly  ignore  any  or  all  coordinates 
which  do  not  appear  in  the  Hamiltonian  function  H  (X,  p) ,  the 
corresponding  constant  momentum,  or  momenta,  playing  the 

role  of  the  energy  constant  E  . 

We  now  ask, what  is  the  parametric  integrand  of  the  calculus 

of  variation  problem  of  Type  1  of  which  \  p  G  1  P*  ♦  V  -  E  is 

a  parametric  Hamiltonian  function?  Denoting  this  function  by 

'  I  X  \ 

\plx  j  we  write  down  the  n  ♦  1  equations  A^p  x-r 


\  G  p*  x 


■  P  G'1  p*  ♦  V  -  E  =  0 


r  ,u  n*  -  G  x  and  we  obtain,  on 

The  first  n  of  these  yield  p  ^  u  y 

2  1 

substituting  this  in  the  last  equation,  A  ■=  2lE-VT *V  °  *T  ' 
Hence 

f  p x_  -  JztE-v)  ‘  !frc'r 

A  ' 


1/2 
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Thus  those  extremal  curves  of  our  original  calcul 
variations  problem  for  which  the  value  E  of  the  energy 
H  =  T  ♦  V  is  given  are  the  extremal  curves  of  a  new  calculus 

of  variations  problem  whose  parametric  mtegrand  is  the 
function  F  just  written.  If  we  assign  to  the  coordinate  space 
of  our  mechanical  system  the  metric  which  is  defined  by  the 

formula  (d  s)2  -  d  x*  G  d  x  ,  then  F  and  I  r  J ^  T  d  r  appeal 

in  the  form  S2  . 

P-  [2,E-V)}''2st:  I-  /  {*<■-*»}  dS 


and  our  new  calculus  of  variations  problem  is  closely  .  elated 
to  the  arc-length  problem  in  n  -  dimensional  space.  If  the 
mechanical  system  were  free  from  applied  force,  so  that  V  is 


constant, 


our  new  calculus  of  variations  problem  is  precisely 


the  arc-length  problem  in  n  -  dimensional  space,  and  the  paths 
of  the  mechanical  system  would  be  the  geodesics  in  the 
n  -  dimensional  coordinate  space,  always  on  the  understanding 
that  this  space  is  assigned  the  metric  which  is  defined  by  the 

formula  (d  s)2  -  (d  x)*  G  d  x  .  When  V  is  not  constant,  the 
paths  of  the  mechanical  system  are  extremal  curves  related 


to  the  integral 


I  /  n  d  s  ; 


n  = 


1  2 
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Integrals  such  as  this  occur  in  the  theory  of  the  propagation 
of  light  through  a  refracting  isotropic  medium,  n  being  known 
as  the  index  of  refraction.  Thus  the  problem  of  determining 
those  paths  of  a  conservative  mechan'.cal  system  for  which  the 
energy  has  an  assigned  constant  value  E  is  the  same  as  that 
of  determining  the  rays  along  which  light  is  propagated  in  a 

1  2 

medium  whose  index  of  refraction  is  proportional  to  (E-V) 
the  metric  assigned  to  the  medium  being  furnished  by  the 

formula  d  s2  -  d  x*  G  dx  ,  where  G  is  the  matrix  of  the 
homogeneous  quadratic  form  in  x^  which  furnishes  the  kinetic 

energy  of  the  conservative  mechanical  system  This  connection 
between  mechanics  and  optics  was  pointed  out  more  than  two 
centuries  ago  by  Maupertuis,  and  is  know.i  as  the  Principle  ot 
Maupertuis. 

We  now  consider  briefly  the  propagation  of  light  in  a 
refracting  isotropic  medium  whose  index  of  refraction,  n  n  (x) , 
is  a  function  of  only  one  of  the  three  space  coordinates;  the 
adjective  isotropic  means  that  n  =  n  (x)  does  not  depend  on  xg  , 

and  when,  in  addition,  n  involves  only  one  of  the  three  space 
coordinates  ,  we  refer  to  the  medium  as  stratified.  If  we  are 

using  rectangular  Cartesian  coordinates  (x,  y,  z)  ,  and  if 
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n  =  n(z)  does  not  involve  the  coordinates  x  and  y  ,  we  say  that 
the  medium  is  stratified  in  planes  perpendicular  to  the  z  -  axis, 
while,  if  we  are  using  space  polar  coordinates  (r, 
and  n  =  n  (r)  does  not  involve  the  coordinates  6  and  (p  ,  we 
say  that  the  medium  is  stratified  in  concentric  spheres  centered 
at  the  origin.  This  latter  case  is  of  importance  in  the  discussion 
of  the  propagation  of  earthquake  waves  through  the  interior  of 
the  earth  (the  index  of  refracMon  n  being  a  constant  multiple 
of  the  reciprocal  of  the  velocity  of  propagation  of  the  waves). 
Whether  the  problem  is  one  dealing  with  the  propagation  of  light 
or  of  elastic  waves,  the  integral  I  -  J  n  d  s  is  a  constant 
multiple  of  the  time  taken  for  the  disturbance  to  pass  from  one 
point  of  the  medium  to  another,  so  that  the  theory  is  governed, 
in  each  case,  by  Fermat’s  Principle  of  Least  Time. 

Treating  first  the  case  of  a  plane  stratified  medium,  the 
parametric  momentum  matrix  is  furnished  by  the  formulas 

px  =nxs;  p2  =  n  ys ;  P3  -  "  *8J  3nd  p?  *  P2  *  P3  =  *  ‘ 

The  coordinates  x  and  y  are  ignorable,  P,  and  P2  being 

constants  along  any  extremal  Set'ing  P^  =  Cj  ,  Pj  c2- 

have  n2  zj  =  (n2  -  cj  c2)  ,  so  that,  along  any  extremal,  s 
is  furnished,  in  terms  of  z  ,  by  the  quadrature, 
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,  2 
(n  -c 


n  d  z 
2 


1 


2.1  2 

c.J 


It  is  clear,  since  y  -p- 

X  i  i 


if  Cj  *  0,  and  x^ 


c 

c 


1 

2 


if  c 2  *  0,  that,  if  not  both  Cj  and  c 2  are  zero,  the  corresponding 

extremal  lies  in  a  plane  parallel  to  the  z  -  axis, and  there  is  no 
loss  of  generality  in  taking  this  plane  to  be  the  plane  y  =  0,  so 
that  c2  =  0  ;  when  we  do  this  we  write  c  instead  of  Cj, so  that 
z  n  d  z 

s=  f  —X - T~r^>  al°nK  our  extremal.  If  both  Cj  and  c2 

J  (n2  -  c2)1 

are  zero,  the  extremal  is  a  straight  line  segment  parallel  to  the 

z  -  axis.  Note:  The  reason  that  wo  can  ob’ain  the  extremals 

without  integrating  the  one  nontrivial  Euler- Lagrange  equation 

(n  z  )  =  n  is  that  our  calculus  of  variations  problem  is 

s  s  z 

1  2  2  2  2 

presented  in  parametric  form;  the  function  l^/(Z)  2  (Pj  +  ^*2  +  ^3  11  ^ 

is  a  parametric  Hamiltonian  func'ion  and  the  combination  of  the 
two  trivial  Euler- Lagrange  equations  (n  x^-0,  (n  ys)s  °» 

and  the  relation  \jr  i Z)  -  0  may  be  used  'o  replace  ’he  three  Euler- 
Lagrange  equation^  once  the  constan'  values  of  P j  n  x  and 

p„  n  v  along  anv  extremal  are  given. 

2  ■  s 
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2  2  2  2  2 

It  follows  from  the  relation  n  zs  =  n  -  Cj  -  c2  that 

n2  (1  -  z2)  +  cl,  so  that  n(l-z2)1  2  is  constant  along 

any  extremal;  this  result,  which  states  that  the  sine  of  the  angle, 
at  any  point  of  the  extremal,  between  the  extremal  and  the 
z-axis  is  inversely  proportional  to  the  index  of  refraction  at 
that  point,  is  known  as  Snell's  Law  of  Refraction.  If  the  index 
of  refraction  is  discontinuous  at  z  =  Zj  (a  finite  number  of 

such  discontinuities  being  allowed  by  the  theory)  the  extremals 
have  corners  at  a  =  a,  .but  Pj  >  n  (1  -  a^)1'2  is  continuous 
at  z  =  Zj  . 

The  velocity  of  a  mass  particle  falling  under  gravity  is 

proportional  to  z1/2  ,  where  z  is  the  depth  below  the  level 
of  zero  velocity,  and  so  the  brachistochrone  problem,  i.  e. ,  the 
problem  of  determining  the  curve  connecting  two  points  along 
which  the  time  of  descent  is  a  minimum,  is  the  same  as  that 
of  determining  the  curves  along  which  light  is  propagated  in  an 
isotropic  medium  stratified  in  planes  perpendicular  to 
the  z  -  axis,  the  index  of  refraction  being  proportional  to 

z-1/2  _  The  velocity  v  of  the  mass  particle  is  furnished 
by  the  formula  v2  =  2  g  z  ,  where  g  is  the  acceleration  due  to 
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•ss*. 


gravity,  and  so  we  set  n  [2  u  z)  Ass'  nnng  that  our 

extremal  is  not  a  line  sigmen’  parallel  ’.<■  ihe  z  -  axis  and 
that  it  lies  In  the  (z  x)  •  plane,  uc  have 


0  (1  -2gc2z)1  2 


,  •,  „  2  1  2}  _ 

1  •  .1  2ge  z  :  c  <0,  sv 


s  being  measured  from  the  level  of  zero  velocity  z  0  The 

1 

maximum  value  of  z  is  — 9  ,and  '‘■hen  z  reaches  this  value 

2gc 

it  begins  to  decrease, si  that  s^  is  negative  and  is  furnished  by 

2  “12 

the  formula  sz  ;  -  fl-2gc  z'*  ,  ra’her  than  by  the  formula 


9  _  _  1 

s  =  (l-2gc  z)  2  .  We  den'  *p  - by  2  a  ,  so  that  the 

Z  2gc 

length  of  the  descending  par*  of  'he  curve,  from  z  -  0  to 
z  =  2  a  ,  is  — ^  -4a.  On  denoting  by  0  the  angle  made  by 


.  o  2  .1/2 

the  curve  with  the  z  -  axis  ,we  have  cos (p-  zs  -  *  <l-2gc  z)  , 


0  varying  from  0  to  *  over  the  descending  pa rt  and  from 


^  to  7t  over  the  ascending  par*  f  the  rur'-e  Thus  s  la(l-cosi^) 

over  both  the  descending  and  the  ascending  part  of  the  curve, 
and  z0-  zs  s0  si n0c<  s0  s  'hat  z  a  .T  cos  20 )  , 
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the  constant  of  Integration  being  determined  by  the  fact  that  z  =  0 
«hen  s,  or,  equivalently  C^isO.  Finally,  x  =  sin0,  so  that 

x(pm  xs80  *  4  a  sln20  and  X  =  xQ  4  a  (2  0  -  sin  2  0 ) ,  where 

Xq  is  the  value  of  x  when  0=0.  Thus  the  parametric  equations 

of  any  extremal  curve  other  than  a  line  segment  parallel  to  the 
z  -  axis,  are 

z  -  a  (1-cos  20)  ;  x  =  xQ  +  a  (20  -  sin  20)  ;  0<  0  <  n 

These  extremal  curves  are  cycloids  traced  by  a  point  on  the 
circumference  of  a  circle  of  radius  a  rolling  on  and  below  the 
level  of  zero  velocity.  This  family  of  extremal  curves  is  a 
2  -  parameter  family,  the  two  parameters  being  a  and  xQ ,  and 

it  can  be  shown,  by  a  detailed  discussion  into  whirh  we  do  not  enter, 
that  there  passes  through  any  two  points  whose  z  -  coordinates 
are  nonnegative  and  whose  x  -  coordinates  are  different,  one,  and 
only  one,  member  of  the  family.  Furthermore,  if  we  fix  the  value 
of  xQ,  the  one  parameter  family  so  obtained  simply  covers  the 

region  z  >  0,  x  >  x^  of  the  (z,  x)  -  plane  and  furnishes  an 

extremal  field  over  this  region.  The  problem  of  Type  1  of  the 
calculus  of  variations  which  is  furnished  by  the  integral 
I  =  /nds,  n>0  is,  since  the  arc -length  problem  is  positively 
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v  ■ 


regular,  itself  positively  regular,  and  so  we  know  that  if  our  two 
given  points  lie  below  the  level  of  zero  velocity  and  do  not  have 
the  same  x  -  coordinate,  the  one  and  only  cycloid  of  our  2  -  parameter 
family  which  connects  them  furnishes  an  absolute  minimum  of  the 
integral  I.  The  excluded  cases,  namely,  those  when  one  or  both 
of  the  two  given  points  lie  on  the  level  of  zero  velocity  and  when 
both  have  the  same  x  -  coordinate, are  easily  cared  for  (the  first 
by  an  argument  involving  considerations  of  continuity  and  the 
second  by  using  as  our  extremal  field  the  1  -  parameter  family 
of  straight  lines  parallel  to  the  z  -  axis) . 

For  a  spherically  stratified  medium  the  parametric 
momentum  matrix  is  Pj  =  n  rg ,  -  n  r2  #g ,  Pg  *  n  r2  sin2#0g  , 


the  parameter  adopted  being  the  arc -length,  and  the  second  and 
third  of  these  are  constant  along  any  extremal  n  r2  0g  =  Cg , 

n  r2  sin 20  (/>  =  c,  .  There  is  no  loss  of  generality  in  taking 

c,  -  0 ,  as  this  can  be  arranged  by  properly  choosing  the  space 

w 


polar  coordinate  reference  frame.  Thus  the  extremal  curves  are 
plane  curves,and  we  may  confine  ctar  attention  to  plane  polar 
coordinates  (r,  Q)  writing,  simply,  nr2  c  . 


S<P12*TP2'"2> 


is  a  parametric  Hamiltonian  function,  and,  since  Pg  =  c  ,  it 
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is  constant  along  any  extremal. 


follows  that  n  r  (1  -  r^)1^ 

s 

Thus  the  product  by  r  of  the  sine  of  the  angle  which  the  extremal 
makes,  at  any  of  its  points,  with  the  radius  vector  is  inversely 
proportional  to  the  index  of  refraction  at  that  point,  this  being 
the  analogue,  for  a  spherically  stratified  medium,  of  Snell's  Law 

of  Refraction  for  a  plane  stratified  medium.  The  relation 

2  2 
2  c  2  2  2  c  2 

Pj  +  ~2  =  n  yields  n  rg  +  -j-  =  n  ,  so  that,  along  any  extremal, 

r  r 

f  r  . 

s  =  J  2  2 - TY/2  *  9  being  measured  r  *  rQ  •  For 

rQ  (n  r  -  c  )  ' 

example,  if  n  is  inversely  proportional  to  r  ,  s  is  a  constant 
times  r  -  rQ  along  any  extremal.  For  any  given  value  of  c  , 

nr)c  along  the  corresponding  extremal;  for  example,  if 
-1/2 

n  =  (2gr)  ,  the  nearest  point  of  the  extremal  to  the  center  of 

2 

the  spherically  stratified  medium  has  a  distance  =  2  gc  from 
this  center. 


Ill 


Lectures  on  Applied  Mathematics 
The  Calculus  of  Varia*ior.s 
Lecture  12 

Problems  Whose  Lagrangian  Functions  Involve  Derivatives 
of  Higher  Order  than  the  First 


We  shall  consider  in  this  lecture  problems  of  Type  1  of 
the  calculus  of  variations  whose  Lagrangian  functions  involve 
derivatives  of  higher  order  ‘han  the  firs*  Confining  our  attention, 
at  the  beginning,  to  the  case  where  L  involves  sec  md-order 
derivatives,  bu»  r.o  derivatives  o'  order  higher  than  the  second, 

L  is  a  function  of  the  following  *hree  ma*nces 


1)  The  N  x  1 

2)  The  n  x  1 

3)  The  n  x  1 


time  -  coordinate  matrix 
vel  .city  matrix  x^ . 
acceleration  matrix  x,,  • 


X  * 


We  denote  by  z  the  (3  n  +  1)  x  1  matrix 


rather  than, 


as  before,  the  (2  n  4  1)  x  1  matrix  (*)  .  Our  state  space  is 

now  (3  n  ♦  1)  -  dimensional  rather  than  (2ntl)  -  dimensional, 
and  the  (2  n  +  1)  ■  dimensional  space  whose  points  are  furnished 
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by  the  (2  n  ♦  1)  x  1  matrices  |  *  |  ,  i.  e. ,  our  previous  state 

space,  will  now  play  the  role  previously  played  by  the  N  -  dimensional 
time-coordinate  space  By  this  we  mean  that  the  various  curves 
x  =  x  (t) ,  a  <  t  ^  b  in  our  time -coordinate  space  will  not  only 

have  their  end-points  A=  | x ^a ) )  and  B"  x  (b) )  Prescribed’ 
but  also  the  end  values  x((a)  and  xt{b)  of  the  velocity  matrix 

will  be  prescribed  It  no  longer  suffices  that  these  curves  be 
piecewise  smooth,since  the  acceleration  matrix  x^  must  be 
defined  along  them,  except,  possibly,  at  a  finite  number  of  points. 

We  say  that  a  curve  x  =  x  (t) ,  a  <  t  <  b  in  our  time-coordinate 
space  has  piecewise-continuous  curvature  if  it  possesses  the  following 
two  properties: 

1)  It  is  smooth,  i.  e. ,  x{  exists  and  is  continuous  over  a  <  t  <  t 

2)  The  interval  a  t  <  b  may  be  covered  by  a  finite  net  over 
each  cell  of  which  xu  exists  and  is  continuous  If  t'  is  an 
interior  point  of  the  net  the  right-hand  and  left-hand  acceleration 
matrices  at  t' ,  xft  (t*  ♦  0)  and  xM  (f  -  0),  exist,  but  they  need 

not  be  equal.  If  they  are  equal  xu  exists  and  is  continuous  at  t', 

and  the  point  t'  may  be  removed  from  the  net 

Each  point  of  a  curve  in  our  time -coordinate  space  possessing 
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J 


piecewise-continuous  curvature 


point  z  =  x 


tt 


of  our  new  (3 


at  which  x((  exists  .defines  a 
n  ♦  1)  -  dimensional  state  space. 


and  we  confine  our  attention  to  those  curves  whose  images  in  this 
state  space  are  covered  by  a  region  D  of  this  space  over  which 
L  -  L  (z)  is,  by  hypothesis,  a  continuously  differentiable  function 
of  the  (3n+l)xl  matrix  z  If  we  wish  to  present  our  problem 
parametrically^  we  must  assume  that  the  function  t  -  t  (T) ,  <*  T ^  6, 
which  introduces  thq  new  parameter  T  ,  is  smooth  over  a  <  T<  B 
and,  in  addition,  that  this  interval  may  be  covered  by  a  finite  net, 
over  each  cell  of  which  t  exists  and  is  continue  as.  Then 


the  curve  X-X(T),  a  <  T  ^  fi ,  possesses  piecewise-continuous 
curvature.and  the  parametric  integrand  F  (X,  X  — ,  *S 

defined  by  the  formula. 


F  (X.  X, 


xrr» 


L  (X, 


■t- 


V'r 


XJL 

'T 


tt 


IT 


Vu 


■T 


In  order  to  obtain  the  Euler- Lagrange  equation  we  imbed 
any  curve  C  possessing  piecewise-continuous  curvature  and 
whose  image  in  our  (3  n  *  1)  -  dimensional  state  space  is  covered  by  D, 
in  a  1  parameter  family  of  curves  C,  all  possessing  piecewise- 
continuous  curvature  and  all  of  whose  images  are  covered  by  D, 
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as  follows  If  C  is  furnished  by  the  formula 


x-x(T),  hTs:b 


we  write 


X=xr)*sfiT);  - 

where  the  N  x  1  matrix  function  f  (T)  possesses  the  following 
two  properties' 

0  f  (T)  possesses,  over  a  ^  7\  B ,  a  piecewise-continuous 
second  derivative. 

2)  Both  f  (T)  and  its  first  derivative,  ^.(T),  vanish  at 
T  =  a  and  at  T  =  6  . 

These  properties  ensure  that  all  our  comparison  curves  C 

possess  piecewise-continuous  curvature  and,  moreover,  that 

X  (or)  =  X  (a) ,  Xr(a)  --  X  'a) ,  X  (B)  =  X  B) ,  X^B)  -  X^B), 
so  that  all  four  N  x  1  matrices  X  (a) ,  X^,(ar) ,  X  (B) ,  V* 
are  independent  of  s  Since  X^.  -  X^.  *  s  ,  X^,^/  S  ’ 


we  have  6  X_  =  (6  X)  , 

T  T 

also,  6  X^.^,-  (6  Xy.)  ■ 


as  in  our  previous  discussion,  and, 

If  6  is  taken  sufficiently  small,  all 


the  curves  C  of  our  1  -  parameter  family  of  comparison 
curves  are  such  that  their  images  in  our  (3n  *  1>  -  dimensional 
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state-space  are  covered  by  the  region  D.over  which  L  =  L  (z) 
is,  by  hypothesis,  a  continuously  differentiable  function  of  z  or, 
equivalently,  over  which  F  (X,  X  ,  is  a  continuously 

/X  \ 

differentiable  function  of  the  3  N  x  1  matrix  J  X^  J.  The  integral, 

lrr) 

ft 

I  =  J  F(X,  X^,X^7.)d  T,  of  F  along  any  one  of  the  curves 


C  is  a  differentiable  function  of  s  and 
6 

(FV6X*F  6X.  +F 

t  r  rr 

As  in  our  previous  discussion  we  may  integrate  the  matrix  product 


6  I 


■  /V-x‘V'x  -rr’- 


F„6  X  by  parts  .obtaining 

A 


J  (Fx  6  X)  d T=  -  J  |c(6X)r|dT  .where  G(T)=  /  Fx(u)du  , 
6  X  being  zero  at  T  a  and  at  T  -  6  .  Since  (6X)^_  =  6X^, ,  we  have 


6  I  - 


l  |(Fxr  '  Gr  xr  *  rxTT  6Xrrl dT 

and  we  now  integrate  by  parts  the  matrix  product,  |^X^,  ® 

Setting  H  ( r )  -  J  (FX^  -  G  (ulj  d  u  ,  we  have 
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OVGi5MdT  =  -riH(5VrldT 


since  6  X_  is  zero  at  T  =  a  and  at  T  =  B.  It  follows,  since 

T 


(6  X^)  =  6  X  .that 

tv  rr 


■n 


61=  |  |/Fx^^-H(r)|6XrT 


dT 


and  it  is  easy  to  see  that  6  I  will  be  zero  for  all  allowable 
choices  of  6  X  =dsf  ,  ifFY  -  H  (7”)  is  a  linear 

rr  r  t  *rr 


1  X  N  matrix  function  of  T  .  Indeed,  on  writing  for  our  later 
convenience,  any  linear  1  x  N  matrix  function  cr  T  in  the  form 
(T  -  a)  c  +  d,  where  c  and  d  are  constant  1  x  N  matrices, 


we 


(T  -  a)  c  +  d  j 


N 


have  {(T-a)  c  +  d  6Xr-=  £ 


rr 


H 


(T 


•°lc,*di| 


6  X*  and 

TT 


fi  i  fi  r  B  i 

1)  J  (T-«)cJtX^lT.cJ(r-«)»^.  [  -Cj  |  »x|dT 


i  8  i 

Cj(r-0f)6XJr  -Cj6XJ 


which  is  zero, since  both  6  Xj  and  6  X*,  are  zero  at  T=  a  and 
at  r=  B  ;  and 
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2>  [Bvxt/M5Xt 


=  0 


Note.'  In  deriving  these  results  we  have  made  use  of  the  relations 

6xjrr:(64V  ’  6Xr  =  (6xi)r  • 

Conversely,  if  6  I  is  zero  for  all  allowable  choices  of  6  X^,  , 

the  1  x  N  matrix  Fx  -  H (T)  must  be  a  linear  function 

rr 

of  T  .  To  see  this, we  introduce  the  1  x  N  matrix  function  of  T  : 


K(T) 


lrr 


(v)  -  H  (v) )  d  v 


du -\{T-  a)2  c  -  g(T- 


a)3  d 


where  c  and  d  are  constant  1  x  N  matrices  which  we  shall 
shortly  determine.  K  (T)  is  continuously  differentiable  over  the 
interval  a  <  T  <  B  .  K  (T)  being  furnished  by  the  formula: 


Kr  (T) 


(v)  -  H  (v))  d  v  -  ( T-  «)  c 


-^<r-„)2d 


Thus  both  of  the  lxN  matrices  K(T)  and  K^.(T)  are  zero  at 
T  -  a  ,  no  matter  what  are  the  constant  1  x  N  matrices  c  and  d, 
and  we  now  determine  these  constant  1  x  N  matrices  so  that  both  of 
the  1  x  N  matrices  K  (T)  and  K  (T)  are  also  zero  at  T  =  B. 
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This  will  be  the  case  if,  and  only  if,  c  and  d.  ,  j  =  1,  . . .  f  N  , 
have  the  values  which  are  unambiguously  determined  by  the  two 


linear  equations 


\  <6  -  a)2  c,  *  5  (6  -  a)3  dj  = 


Fxj  (v)  -  H,(v)l 

rr  ' 


d  u 


(B  ■  «)  c,  *  \  (B  -  a)2  j  FX>rrM  -  HjW}  *  ’  ' 

K  (T)  has,  over  the  interval  a  <  T<  fl  ,  a  piecewise-continuous 
second  derivative,  K  <r>  •  F„  <T>  -  H  (T)  -  c  -  (T-  or)  d, 

rr  xrr 

and  so  we  may  take  for  our  Nxl  matrix  f  (T)  the  transpose 
K^(T)  of  the  1  x  N  matrix  K(T):  when  we  do  this,  6  I,  which 
may  be  written  in  the  form 


r6 

6^  / 

(fx  -  H  <r)  •  c  -  <r-  o)d(  5  X 

J a 

.1  rr  *  J 

appears  as  the  product  of  the  integral,  over  the  interval  a  <  T<  8 , 
of  the  squared  magnitude  of  the  1  x  N  matrix, 

F  -  H  (  T)  -  c  -  (T  -  <*)  d,  by  d  s  .  Hence,  for  6  I  to  be  zero, 

*TT 

this  1  x  N  matrix  must  be  zero  at  all  its  points  of  continuity, 
in  other  words,  Fx  -  H  (T )  =  c  *  (T  -  a)  d  at  all  the  points  of 

TT 
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the  curve  X  =  X  (T) ,  a  <  fi  ,  at  which  X  is  defined  (and, 

hence,  by  hypothesis  continuous),  c  and  d  being  unambiguously 
determinate  constant  lxN  matrices.  Since  H  (T)  is  continuous 

over  a  <  fi,  it  follows  that  if  T' ,  say,  is  any  one  of  the 
finite  number  of  points  of  this  interval  at  which  X  is,  possibly, 

TT 


undefined,  Fv  ( T '  -  0)  -  F  (f'  +  O).  Since 

xrr  rr 

I  x  •  ^  ^ 

F  =  L  tr  and  x„  =  -  — -- — -T,  the  first  element  of  F^  r 


lr 


'tt 


t* 


r 


is  the  negative  of  the  quotient  of  the  matrix  product,  L  x.  by  t  , 

tt  T 


while  the  remaining  n  elements  of  Fv  are  those  of 

xrr 

—  l  Hence  since  t  is,  by  hypothesis,  continuous  over  the 

tr  xtt  r 


interval  oi<T<fi,  we  have 


L  (f  -  0)  =  L  (t'  +  0)  ;  t'  =  t  (T’> 
xtt  Xtt 

If  the  n  -  dimensional  matrix  L  , *  exists  and  is  continuous 

xttxtt 

and  nonsingular  over  the  region  D  of  our  (3  n  +  1)  -  dimensional 
state-space,  xt(  is  unambiguously  determined  along  the  curve 

x  =  x  (t) ,  a  <  t  <  b  by  L  and  so  x  (t>  -  0)  =  x  (f  +  0) ; 

A.  . 
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3 


in  other  words,  x^  is  not  merely  piecewise  continuous, but  actually 

continuous  over  the  interval  a  <  t  <  b . 

At  any  point  of  the  curve  X  -  X  (T) ,  a  <  T<  B,  at  which 

X_,_  is  defined,  H  (7)  is  differentiable  with  the  derivative 
TT 

H  (T)  -  F_  -0(7*),  and  so  we  obtain  from  the  relation 

r  *r 

F_  -  H  (r)  -  c  +  (T-  «)  d,  the  existence,  over  a  <T<  B, 

rr 


of  |fx  |  ,  with  the  value  Fx  -G(T)  +  d.  Moreover,  at 

all  such  points,  G  (T)  is  differentiable  with  the  derivative 

G_.tr)  -F„.  If  the  n  -  dimensional  matrix  L  *  exists 
7  a  xttxtt 

and  is  not  only  continuous  and  nonsingular  over  D  but  also 
continuously  differentiable  over  D ,  x^  exists  and  is  continuous 

over  the  interval  a  <  t  <  b,  and  so  |  Fv  exists  and  is 

1  xr/r 

continuous  over  a  <  T<  A  (it  being  necessary  to  assume,  at 
this  point,  that  t _ _  exists  and  is  continuous  over  a  <  T<  6) . 

rrr 

Hence  /  Fv  1  exists  over  a  <  7<  B  with  the  value 

I  rrlrr 

( FXr )  ' ■  Fx  >  80  that 


<*<  r<  B 
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This  is  the  Euler -Lagrange  equation  for  a  problem  of  Type  1  of 
the  calculus  of  variations  whose  Lagrangian  function  involves  V 


When  we  use  the  Lagrangian  parameter  t ,  this  Euler- Lagrange 
equation  appears  as 


tt 


a  <  t  <  b 


It  is  clear  from  the  argument  by  which  it  was  derived  that  if  L 
involved,  in  addition  to  the  matrices  X,  x^.and  the 


matrix  x^  ,  the  corresponding  Euler- Lagrange  equation  would  be 


L  1  -  L  *  L  -  L  0;  a  <  t<  b 

xttt]ttt  Xttjtt  Xt]t  X 


and  so  on. 

Example,  n  =  1,  L  =  ^  d  xft  *  |  ^e2xf  *  2  elxt^  +  \  ^ 2 ^  +  2  ^lx^  ’ 
the  coefficients  d,  e2,ej,  f2,and  being  given  functions  of  t. 

L  =  d  x  .  and  the  matrix  L,*  ,  here  1  -  dimensional,  is 

xtt  1  *tt  tt 

the  coefficient  d.  We  assume,  then,  that  d  is  continuously 

differentiable  over  a  <  t  <  b,  and  we  know  that  our  problem  does 

not  possess  extremals  unless  d  x^  possesses  a  continuous  second 

derivative  with  respect  to  t  along  them.  We  assume,  therefore, 
that  not  only  is  d  continuously  differentiable  over  a  ^  t  <  b  but  that 
it  possesses  a  continuous  second  derivative  over  this  interval. 
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Similarly,  since  *  ®2  xt  *  el  *  *V  *  *2  x  *  *1 »  "*  M*U"le 
that  e,  and  e.  are  continuously  differentiable ,  and  that  f2 
and  are  continuous  over  a<  t<  b.  Our  Euler- Lagrange 
equation: 

is  an  ordinary  Unear  differential  equation  of  the  fourth  order. 
Writing  i  (t)  *  x  (t)  ♦  y  (t) ,  a  <  t  <  b .  where  both  y  (t)  and 
yt(t)  are  zero  at  t  *  a  and  at  t  =  b,  we  find 

I  (C)  -  I  (C)  •-  J^Jd  (xw  yM  *  \  yM2)  ♦  e2  <xt  yt  +  5  yt2) 

♦  ej  yt  /2  (xy  ♦|y2>+  fj  y|  d  1 

A  repeated  integration  by  parts  yields,  since  both  y  (t)  and  yt  (t) 


are  zero  at  t  -  a  and  at  t  -  b , 

jb  dxttyn<it  “  -  /b(dvi  ytdt'  r,dv«yd' 

and,  since  f"  (e  x,  *  e,)y,  dt  .  -  j“  <*.,  x,  y  * . 

Ja  a 


1(C)  = 


•  i  {<dx>t»u 


-(e2n,.e1), 


d  t 
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Since  C  is,  by  hypothesis,  an  extremal  curve  of  our  problem, 
the  first  of  the  two  integrals  on  the  right  is  zero,  since  its  integrand 
is  zero,  and  so 

I  (£)  -  I  (C)  =  \  J  (dy2  +  e2  yt2  +  f2  y2)  d  t 

Thus,  if  d,  e2,and  /  are  nonnegative  over  a<t<b,  I  (C)  ^  I  (C) . 

If  d  is  positive  over  a  <  t  <  b ,  the  weak  inequality  I  (C)  >  I  (C) 
may  be  replaced  by  the  strong  inequality  I  (C)  >  I  (C)  unless  ytt 

is  zero  at  all  its  points  of  continuity;  if  this  were  the  case  y  Would 
be  zero  over  a<t^b,  since  both  y  and  yt  are  zero  at  t=a. 
Hence,  the  relations  d>0,  ej  >  0 ,  f2  >  0  over  a  <  t  <  b  are 

sufficient  to  ensure  that  I  (C)  >  I  (C),and  consequently, that  our 
extremal  curve  is  a  minimal  curve,and  moreover, that  it  is  the  one 
and  only  extremal  curve  for  which  x  (t)  and  x^(t)  have  prescribed 


values  at  t  =  a  and  at  t  =  b 

When  presented  parametrically,  our  integrand  function 
F  (X,  X^. ,  ^Y'f)  is  such  that  F  (X,  kX^  ,  k2  X.^)  =  k  F 
k  >  0 ,  and  we  obtain  on  differentiating  this  relation  with  respect  to  k 
and  then  setting  k  *  It 


P  X  *  2  Q  X  -  F 

T  TT 


P  *  F  ,  Q  =  F 

r  rr 
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*r  Qi 

On  denoting  by  q,  we  have  Qj  -  q  ,  Q2  “  t^T  * 

O  «  ,  go  that  Q  X_  *  0 .  When  the  end-points  of  our  curves 

t^  T 


may  vary ,  we  have 


SI-jlP-Oj-ISX-QS  xt||a  *  ja  |(FX  -  Py  rx^)  8  xjdf 

so  that,  when  the  curve  of  integration  is  an  extremal  curve, 


*  {< p  •  V 


6  X  +  Q  6 


The  Hilbert  invariant  integral  is 


I*  =  j|(P  -Qr)dX^QdXrj 

where  the  two  Nxl  matrix  field- functions  ♦-  V ,  U,  are 
substituted  for  X  and  Xfrr  .  respectively,  in  the 

coefficients  P  -  and  Q .  When  the  curve  of  integration  is  an 

arc  of  a  member  of  our  extremal  field  (  now  a  2  n  -  parameter  field  ) 
I*  has  the  same  value  as  I .  Indeed,  since  QXj.  =  0,  F  -  P  *  2  Q 


may  be  written  as  (P  -  Q^.  )  X^  Q  X^.^, 


t  so  that 


As  an  example,  let  us  consider  the  problem  for  which 
L  *  (x^  -  k*  x2) ,  k  a  positive  constant.  The  Euler-Lagrange 

equation  is  xtm  -  k4  x  *  0,  so  that  the  extremal  curves  are  of  the 

form  x  ■  Sj  cos  (k  t  -  6j)  +  s2  cosh  (k  t  -  b^,  where  Sj  ,  s^ ,  6^ , 

6,  are  constants  of  integration.  The  2  -  parameter  family  obtained 

i 

by  setting  =  0  *  ®2  ~  0  i8  an  extremal  fieW  over  ^  8lice 
-l-C<t<l+€  of  our  |  *  J  space  if  the  Jacobian  matrix  of 

s.  cos  kt  +  s9  cosh  kt  \ 

1  1  .  with  respect  to  the  2x1  matrix 

-  kSj  sin  kt  ♦  ks2  slnh  kt  / 

(Sl  I  is  nonsingular  over  this  slice,  i.e. ,  if 
®2  / 

cos  kt  sin  h  kt  +  sin  kt  coshkt  is  positive  over  the  interval 
-l-€<t<l  +  €.  The  least  positive  zero  of  the  equation 
cos  k  sinh  k  +  sin  k  coshk  =  0  plays  the  role  played  by  y  ,  the 

least  positive  zero  of  the  equation  cos  k  =  0 ,  in  the  problem  whose 

Lagrangian  function  was  \  (x  2  -  k2  x2) .  As  in  that  problem,  we 

form  the  Rayleigh  quotients 
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for  functions  x  *  x  (t) ,  -  1  ^  t  ^  1 ,  which  vanish,  together  with 
their  first  derivatives,  at  t  -  -1  and  at  t  -  1 .  Then  the  fourth  power 
of  the  least  positive  zero  of  the  equation  cos  k  sinh  k  >  sin  k  coshk  *  0 
is  the  greatest  lower  bound  of  all  the  Rayleigh  quotients  which  we  obtain 
in  this  way.  In  order  to  justify  this  statement  we  have  to  show  that 

f  1  2  i 

J  (xtt  -  k  x  )  d  t  is  positive  for  all  curves,  other  than  the 

line  segment  *(t)-0,  which  are  such  that  x  (t)  and  5ct  (t)  are 

zero  at  t  -  -1  and  at  t  -  1 ,  provided  that  k  is  any  positive  number 
less  than  the  least  positive  zero  of  the  equation 

cos  k  sinh  k  +  sin  kcosh  k  =  0 .  In  order  to  show  that  I  (C)  >  I  (C)  *  0 , 


we  calculate  the  Weierstrass  E  -  function, 

F  (X,  Xf  ,  )  -  (P  -  Qr )  *r  -  Q  X  .  Since 

t  x  -  x  t 

1  2  4  2  T  TT  T  TT 

F  =  L  t  =  |  (x‘  -  k  x  )  t  and  xft  = - g - ,we  find 

T  c 

(t  x  -  x  t  )  x  5  (t  x  -  x  t  )2 

r  rr  r  rr  rr  r  rr  r  rr  L4  2 

1  V  C  2  0 

T  T 


P2  =  Fx 


(t_.x  -x  t  )  rr 

r  rr  r  rr 


<lrxrr '  Vrr)xr 


i  «2  '  Fx 


t  x_  -  x_  t_ 

r  rr  r  tt 


127 


r 


Using  the  Lagrangian  parameter  t,  t^  =  1,  -  0.  so  that 

pl  =  -  I  -  4  k<  x*  :  P2'°;  VXtt 

and  the  Weierstrass  E  -  function  appears  as 

E  =  i  G*  -  k4  X  2)  .  i  (x2  *  k4  x2)  xt  *m  *  xm  ;t  -  xlt  i„ 

3 

where  xtt  =  -  k2  Sj  cos  kt  k2  s2  cosh  kt ;  xttt  -  k  Sj  sin  kt  +  k  Sj  sinh  kt; 
and  s.  and  s„  are  determined  unambiguously  as  functions  of  x  and 

1  it 

x(  by  means  of  the  equations  x  -  s^  cos  kt  +  s2  cosh  kt, 

xt  -  -  k  Sj  sin  kt  +  k  s2  sinh  kt .  Since  x  =  x ,  xt  -  ^  >  this  reduces  to 

1  *  -2  -  «U  x„  -  2  -  x«'2.  x"d  90  I  ©  -  ■  (C»  -  \\  «  *  >  0. 

the  equality  holding  only  when  x^  -  xu  at  all  the  points  of  C  at  which 
x  exists  (  and  is,  consequently,  continuous).  Since  both  x  -  x  and 
(x  x)t  are  zero  at  t  -  1 ,  it  follows  tliat  I  (C)  -  I  (C)  =  0  only  when 
C  coincides  with  C  .  Hence  I  (C)  >  I  (C)  -  0  when  C  is  different 

from  C  ,  and  it  follows  that  \ ^ f \ dt>k,  where  k  is 

any  positive  number  less  than  the  smallest  positive  zero  k*  of 
cos  k  sinh  k  ♦  sin  k  cosh  k, and  x  (t)  is  any  function  of  t  which 
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possesses  a  piecewise -continuous  second  derivative  over  the  interval 
-  1  <  t  <  1,  and  which  vanishes,  together  with  its  first  derivative, 
at  t  *  -  1  and  at  t  -  1 .  If  we  set  x  *  cosh  k*  cos  k*t  -  cos  k*  cosh  k*t , 
we  have  xu  -  -  k*2  (cosh  k*  cos  k*t  +  cos  k*  cosh  k*t)  ,  and,  since 


r  1  I 

cos  k*t  cosh  k*t  dt  *  —  (cos  k*  sinh  k*  -*•  sin  k*  cosh  k*)  =  0 , 
J-1  k* 

4  4 

the  Rayleigh  quotient  for  this  function  is  k*  .  Hence’  k*  is  the 


greatest  lower  bound  of  all  the  Rayleigh  quotients 


xMl  *t2)2,  is 


63 

2 


.  The  Rayleigh  quotient  for  the  function, 
and  so  the  least  positive  zero  of 


cos  k  sinh  k  *  sin  k  cosh  k  is  <  (31  b)1''4  -  2. 36907.  The  equation 
cos  k  sinh  k  *  sin  k  cosh  k  -  0  may  be  written  in  the  equivalent  form 
cos  2  k  cosh  2  k  -  1  =0  and  the  least  positive  zero  of  cos  2  k  cosh  2  k  -  1 
is  k*  2.36502  to  five  decimal  places  (as  given  by  Rayleigh,  Theory  of 


Sound,  Vol.  1,  p.  223). 

Exercise.  Set  x  -  (1  -  t2)2  (1  tst2)  and  determine  the  minimum 

r1  2  If1- 2 

value  of  the  Rayleigh  quotient,  R  (s>  J  xtt  dt  j  j  ^  x  dt,  thus 


obtaining  a  closer  approximation  to  the  fourth  power  of  the  least 
positive  zero  of  cos  2  k  cosh  2  k  -  1 . 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  13 

Multiple-Integral  Problems  of  the  Calculus  of  Variations 
In  problems  of  Type  1  of  the  calculus  of  variations, we  were 

fb 

concerned  with  integrals  I  -  j  L  dt  extended  along  piecewise- 

a 

smooth  curves  x  -  x  (t) ,  a  <  t  ^  b  in  our  N  -  dimensional  time- 
coordinate  space,  x  being  a  n  x  1  matrix  and  N  being  n  +  1 .  We 
now  turn  to  problems  of  the  calculus  of  variations  in  which  we  are 
concerned  with  integrals  I  -  f  L  d  (x)  extended  over  n-dimensional 

point  loci  in  N  ( -  n  1)  -  dimensional  space,  and  we  refer  to  these 
problems  as  problems  of  Type  2.  Instead  of  n  functions  x  =  x  (t) 
of  a  singlp  independent  variable  or  parameter,  we  now  have  one 
function  u  -  u  fx)  of  n  independent  variables  or  parameters,  the 
elements  of  a  n  x  1  matrix  x  .  As  the  point  x  varies  over  some 
region  plus  its  boundary,  of  the  n  -  dimensional  parametric  x-space, 
the  equation  u  -  u  (x)  furnishes  an  n-dimensional  point  locus  in  the 
N-dimensional  j*J-  space  If  n  -  2  ,  this  is  a  surface  in 

3-dimensional  space, and  we  shall  denote  it  by  the  symbol  S  even 
when  n  is  greater  than  2.  On  denoting  by  R  *  R'  the  region,  plus 


its  boundary,  of  our  n  -  dimensional  x  -  space  over  which  x  varies, 
the  equation  of  S  is 

u  =  u  (x)  ;  x«c  R  +  R' 

We  shall  suppose  that  S  is  smooth,  i.e. ,  that  u  possesses,  over 
R  +  R’ ,  a  continuous  gradient  lxn  matrix  u^.  This  gradient 

matrix  plays  the  role  played  by  the  n  x  1  velocity  matrix  x{ 


in  problems  of  Type  1.  We  denote  by  X  the  N  x  1  matrix  I  J 

/X  \ 

and  by  z  the  (2n  ♦  1)  x  1  matrix  ,  and  we  term  the 

'  ux 

(2n  +  1)  -  dimensional  z-spare  the  state-space  of  our  problem  of 
Type  2  of  the  calculus  of  variations.  The  Lagrangian  function 

L  -  L  (z)  -  L  (X,  ux)  of  this  problem  is  a  function  of  z  = 

which  we  assume  to  be  a  continuous  function  of  x  and  a  continuously 
differentiable  function  of  u  and  u^  over  a  given  region  D  of 

our  (2n  ♦  1)  -  dimensional  state-space.  If  X  is  any  point  of  a  smooth 
n  -  dimensional  point  locus  S  in  our  N  -  dimensional  X  -  space, 


X  \ 


the  (2n  ♦  1)  x  1  matrix 


furnishes  a  point  in  our  (2n  +  1) 


\  */ 

dimensional  state  space,  and  we  term  the  point  locus  consisting  of  all 
these  points  the  image  P  ,  in  the  state-space,  of  S.  If  this  image 
P  is  covered  by  D,  the  integral 
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I=J  L  (X.  u  )  d  (x)  ;  d  (x)  =  d  (x1,  ...  ,  xn) 

R  ♦  R'  X 

exists  by  virtue  of  the  continuity  of  L  (z)  over  D  and  of  the 
smoothness  of  S  The  points  of  S  which  are  furnished  by  the 
boundary  R'  of  R  constitute  the  boundary  of  S ,  and  we  denote 
this  (n  -  1)  -  dimensional  boundary  of  S  oy  S'  »  and  our  problem 
of  Type  2  of  the  calculus  of  variations  has  two  parts: 

1)  The  determination  of  the  extremal  n  -  dimensional  point 
loci;  i.e. ,  the  point  loci  for  which  the  integral  I,  evaluated  for 
all  smooth  n  -  dimensional  point  loci  having  a  prescribed  (n  -  1)  - 
dimensional  boundary,  has  a  stationary  value. 

2)  The  determination  of  necessary,  or  of  sufficient,  conditions 
for  an  extremal  n  -  dimensional  point  locus  S  to  be  a  minimal 

n  -  dimensional  point  locus;  l.  e. ,  such  that  I  (S)  ^  I  (S)  if  S  is  any 
comparison  n  •  dimensional  point  locus  which  is  sufficiently  near  S. 
Note:  We  shall,  from  now  on,  refer  to  an  n-dimensional  point  locus 

IX  \ 

-  space  as  a  surface,even  when  n  >  2  . 

When  n  =  1  ,  the  distinction  between  problems  of  Type  1  and 
problems  of  Type  2  of  the  calculus  of  variations  disappears. 

In  order  to  obtain  the  Euler-  Lagrange  equation  for  our  problem 
of  Type  2  of  the  calculus  of  variations,  we  imagine  any  smooth  surface 
S  whose  equation  is 

u  =  u  (x)  ;  xcR»R' 
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and  whose  image  P  in  our  (2n  ♦  1)-  dimensional  state-space 
is  covered  by  D,as  imbedded  in  a  1  -  parameter  family  of 
smooth  comparison  surfaces  S  whose  equations  are 
u  -  u  (x)  ♦  s  }  (x)  ;  x  c  R  «■  R‘  ;  -  6<s^6 
vhere  f  (x)  is  continuously  differentiable  over  R  R'  and  is 
zero  over  R' ,  so  that  all  the  members  of  our  family  of  comparison 
surfaces  have  the  same  boundary  as  S  If  6  is  sufficiently  small, 
the  image  P  of  any  member  S  of  this  family  of  comparison 
surfaces  in  our  (2n  ♦  1)  -  dimensional  state  space  is  covered  by  D, 
and  the  integral 

I  (S )  -  f  L  (X.  u  )  d  (x) 

R  -  R' 


exists  and  is  a  differentiable  function  of  s  over  the  interval 
-  5  4  s  <  6  .  We  denote  by  6  I  the  differential  of  this  function  of  s  , 
evaluated  at  s  =  0,  and  we  term  6  I  the  variation  of  I.  Then 


6  I  /  Pu  6  u  '  6  u,  Lu  fd  ,,l) 

JR  ’  R'  ’  X  1 


where  iu  ds|  and  6  ux  -  d  s  fx  are  the  variations  of  u  and 

u  respectively,  d  s  being  an  arbitrary  number.  L  ,  which  is 
x  ’  x 

an  n  x  1  matrix,  plays,  for  problems  of  Type  2  of  the  calculus  of 
variations  the  role  played  by  the  lxn  Lagrangian  momentum  matrix 
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p  =  L  for  problems  of  Type  1 .  We  observe  that  fiu  =  (fiu)^ 
xt 

and  we  avail  ourselves  of  this  fact  to  transform,  under  the  hypothesis 

that  L  is  continuously  differentiable  over  B  ♦  R' ,  the  multiple 
ux 

integral  I  Ifiu  L  1  d  (x) ,  just  as,  when  treating  problems 

JR-R'  l  xJ 

of  Type  1 ,  we  used  the  rule  of  integration  by  parts  to  transform 
the  corresponding  integral  jpfi  (xt)j  dt.  The  rule  which  corre¬ 


sponds  for  multiple  integrals  to  the  rule  of  integration  by  parts 
for  simple  integrals  is  as  follows:  If  v  and  w  are  functions  of 
the  n  x  1  matrix  x  which  are  continuously  differentiable  over 
R  ♦  R'  ,  then 


[  v  w  .  d  (x)  -  f  vwd  (x2,  ...  ,  xn)  -  / 

Jo  .  n.  **  O '  F 


R  *  R  * 


v  .  wd(x) 
R  +  R’  * 


V  w  d  (x)  ■-/  vwd  (xl,  x3,... ,  xn)  -  / 

K  *  K  "'r'  R+R 


v  ,wd(x) 
t  x  *■ 


I  V  W  d  (x)  -  (-1)"'  *  /  vwd  (X1,..  ,  x"*1)  -  J  V  wd(x) 
JR.R-  *n  JR’  x 

/  w1  \ 

If  w  |  ;  jisanxl  matrix  function  of  the  n  x  1  matrix  x 


which  is  continuously  differentiable  over  R  ♦  R" ,  it  follows  that 
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[  v  div  w  d  (x)  -  J  v  w  d  S'  -  J  v  w  d  (x) 

Jr»  a,  JC'  R  +  R'  * 


R  -  R' 


where  divw,  the  divergence  of  w,  is  the  sum, 

w\  *  . . .  ♦  wn  ,  of  the  diagonal  elements  of  the  n  -  dimensional 
x1  xn 

matrix  w^,  and  w  d  S'  is  an  abbreviation  for 

d  (x2,  .  ..  ,  xn)wl  -  d(xl,x3,  ...  ,  xn)  w2  ♦  ...  ♦  (-l)n“1d(x1 _ xn_1)wn 

evaluated  over  R' .  The  integral  j  v  w  d  S'  is  known  as  the  integral, 

JS' 

over  the  boundary  S'  of  S  of  the  nxl  matrix  vw,  or  as  the 

flux,  through  S' ,  of  the  vector  which  is  furnished  by  the  nxl 

matrix  vw.  If  v  is  zero  over  R’,  we  obtain  the  simplified  rule 

I  v  div  w  d  (x)  -  -  I  v  w  d  (x) 

JR  +  R'  JR  ♦  R’ 

Setting  v  -  6u  (  w  -  L  ,  under  the  hypothesis  that  Lu  is 


continuously  differentiable  over  H  *  R' ,  we  obtain 

f  (6u )  L  d  (x)  -  -  [  (CudivL  )d(x) 
xu..  Jn  .  o'  u„ 


R  +R' 


R  ♦  R 


and  it  follows,  since  (6u)x  -  6ux ,  that 


6  I  -  f  L  -divL  5u  d(x) 

JR  ♦  R'' *  U  x*  » 
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The  same  argument  as  in  the  case  of  problems  of  Type  1  shows 

that  the  necessary  and  sufficient  condition  that  6  I  be  zero,  for 

all  allowable  choices  of  j  (x) ,  is  that  u  =  u  (x)  satisfy,  over 

R  +  R’,  the  partial  differential  equation 

L  -  div  L  -  0 
u  u 

x 

which  is  the  Euler- Lagrange  equation  for  our  problem  of  Type  2 

of  the  calculus  of  variations.  Indeed,  if  Ly  -  div  Ly  were 

x 

different  from  zero  at  a  single  point  of  R  +  R’  it  would,  by  virtue 

of  its  continuity,  be  different  from  zero  at  a  point  x*  say,  of  R 

and  hence  it  would  be,  again  by  virtue  of  its  continuity,  one-signed 

over  a  neighborhood  of  x*  which  is  covered  by  R.  Taking  f  (x) 

to  be  one-signed  over  this  neighborhood  and  zero  elsewhere,  6  I 

could  not  be  zero,  since  it  reduces  to  the  integral  of  the  product  by  ds 

of  the  function  (L  div  L  )  f  (x)  over  the  neighborhood  in  question. 
u  x 

and  this  function  is  one-signed  over  this  neighborhood.  This  proves 

the  necessity  of  the  equation  L  -  div  Ly  =0,  xcR  *R  ,  for 

x 

6  1  to  be  zero  for  all  allowable  choices  of  f  (x) ,  and  the  sufficiency 
of  this  condition  is  evident. 
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Example .  Denoting  by  A^  u  the  squared  magnitude,  ux  ux  ,  of 
the  gradient  1  x  n  matrix  ux  of  u : 


A.  u  =  u  u  *  =  (u  ,)2  +  •  •  •  +  (u  )2 

1  x  x  x1  xn 

o 

let  L  -  Aj  u  ♦  g  u  ,  where  g  is  a  given  function  of  x  which  is 

continuous  over  R  +  R' .  Then  L  =  2  ux  ,  so  that  div  «  2  Ag  u , 

ux  * 

where  A„u-u  ,  +u„  „  is  the  sum  of  the  diagonal 

2  x^x1  xnxn 

elements  of  the  n  -  dimensional  matrix 

Laplacian  of  u.  Since  Lu  -  2  g  u  ,  the  Euler- Lagrange  equation  is 


A2  u  *  g  u 


and  the  problem  of  determining  extremal  surfaces  of  the  Lagrangtan 
function  L  -  Aj  u  +  g  u  over  a  given  region  R  of  our 

n  -  dimensional  x-space  is  that  of  determining  a  solution  of  the 
second  order  partial  differential  equation  Aj  u  =  g  u  .which  takes 

assigned  values  over  the  boundary  R'  of  R. 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  14 

Constrained  Problems;  Characteristic  Numbers 

Since  for  problems  of  Type  2  of  the  calculus  of  variations 
we  have  only  one  dependent  variable,  we  cannot,  as  we  did  when 
considering  problems  of  Type  1,  impose  a  constraint  of  the  form 

I  \ 

0  (X)  2  *  =0,  since  a  constraint  of  this  type  would  determine  u 

as  a  function  of  x  and  would  not  permit  us  to  vary  u.  We  may, 
however,  impose  a  weaker  constraint  of  the  form 

J  0  (X)  d  (x)  -  C  ;  0  continuous  and  not  identically  zero 

over  RtR',  where  C  is  some  given  constant.  On  differentiating 
this  relation  with  respect  to  s,  when  u  is  replaced  by  u  =  u  *  sf(x), 
and  then  setting  s  =  0 ,  we  obtain 

j  (<p  6  u)  a  (x)  -  0 

R  ¥  R'  U 

and  in  order  that  6  1,  where  1^  /  L  (X,  u  )  a  (x) ,  may  be  zero 

R 

for  all  allowable  choices  of  f(x),  it  is  necessary  and  sufficient  that 


138 


<  L  -  div  L  5u)  d  (x)  =  0  for  all  allowable  choices  of 
R  +  R"  U  V  ' 

6u  =  f  (x)  ds  which  respect  the  relation  J  («  6u)  d  (x)  -  0. 

R  +  R'  u 


It  is  evident,  then,  that  a  sufficient  condition  for  6  I  to  be  zero, 

for  ,  11  allowable  choices  of  f(x) ,  is  that  L  -  div  L  be  a  constant 

u  x 

multiple  of  $  and  the  following  argument  shows  that  this  sufficient 

condition  is  also  necessary.  If  A  is  any  constant,  the  statement 
that 


/  . 

j  L  -  div  L  )  fiu  >  d 

R  +  R’ 

|'  u  V  1 

choices  of  6u  which  respect  the  relation 
implies  that 


/  <♦„ 
n  .  n  t  u 


6u)  d  (x)  =  0 


U 


R  ♦  R' 

L  -  div  L  -  A*,,)  6u)  d  (x)  =  0  for  all 
u  ux  u  j 

J  (♦  6u)  d(x) 


allowable  choices  of  6u  which  respect  the  relation 


R  ♦  R 


,  u 


If  we  were  allowed  to  set  6u  -  L  -  div  L  -  A^u  ,  the  equation 

x 

I  ( L  -  div  L  -  A*  Uu>  d  (x)  =  0  or,  equivalently, 

K  +  R  x  '  I 

I  L  -  div  L  -  A  0  2  d  (x)  =  0  would  imply,  since 

J  .  U  u  u  1 

JR  ♦  R  x  ' 


L  -  div  L  -  A$  is,  by  hypothesis,  continuous  over  R  R’,  that 
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L  -  div  L  =  Ad  over  R  ♦  R’ .  In  order  to  respect  the  condition 
u  ux 

I  L  6u\d(x)  =  0 ,  A  must  be,  when  we  set  6u  =  Lu*  div  Lu~Adu 
R  +  RM  U  *  x 

the  quotient  of  the  integral  of  (L  -  div  L  )  0u  over  R  +  R'  by  the 

u  x 

integral  of  d^  over  R  ♦  R' .  There  remains  the  difficulty  that  we  are 

not  allowed  to  set  6u  =  L  -  div  L  -  A  du  without  the  assurance  that 

u  x 

L  -  div  L  -A*  is  zero  over  R',  but  we  avoid  this  difficulty  as 
U  ux  yu 

follows:  Let  R^  be  a  subdomain  of  R  which  is  such  that 

r*-*»r  +  R'  as  €  0  .  Then  the  integral  of  1  over  R  +  R'  -  R* 

tends  to  zero  with  €,and  we  set  6u  =  L  -  div  -Adu>  writh  the 

x 

previously  determined  value  of  A,  over  R*  j  whereas,  over 
R+R'-R^  ,  6u  is  any  continuous  function  of  x  which  is  zero 

over  R’.  Then  the  integral  of  (L  -  div  L  -Adu>  6u  over  is 

ux  c 

nonnegativr,  whereas  the  integral  of  this  samy  expression  over 
r  +  r>  .  r*  tends  to  zero  with  £  since  the  integrand  is  bounded 


over  R  +  R'  -  R*  .  Since  the  integral  of  (L  -  div  L  -A0  )  6u 
t  u  ru' 

over  R  +  R*  is,  by  hypothesis,  zero,  the  limit,  as  £-*D,  of  the 
integral  of  this  expression  over  ,R*  is  zei  ,  and  since  this  integral 
does  not  decrease  as  £— *>0,  its  integrand  being  nonnegative,  the 
value  of  the  integral  over  R*  must  be  ero  ;  this  implies  that 

L  -  div  L  =  A0  over  R*  regardless  of  the  value  of  £  , 
u  u  ru  c  • 

x  t 

It  follows  by  virtue  of  the  continuity  over  R  +  R'  of  each  of  the 

two  sides  of  this  equation  that  L  -  div  L  =  A0  over  R  -»  R‘ . 

u  u  u 

x 

1  2 

The  case  where  0  -  ,  u  ,  so  that  0  =  u ,  is  particularly 
«  u 

interesting.  If  L  does  not  involve  u  explicitly,  so  that  Ly  =  0  , 

the  partial  differential  equation  which  determines,  in  combination  with 
the  assigned  values  of  u  over  R',  the  extremal  surfaces  of  the 
constrained  problem  of  Type  2  of  the  calculus  of  variations: 


I 


r 

■J. 


R  *  R 


L  (x,  ux)  d(x) 


:  L  _.u 


d(x)  =  constant 


R  +  R' 


is 


div  L  -t  A  u  =  0 
u 

x 
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and  we  see  on  multiplying  this  equation,  by  u  and  integrating  the 

product  over  R  ♦  R'  that  A  is  the  negative  of  the  quotient  of  the 

integral  of  u  div  L  over  R  >  R'  by  the  given  value  of  J  u2  d(x). 

ux  R  ♦  R’ 

Since  u  =  0  over  R\  the  negative  of  the  integral  of  u  div  Lu  over 


R  +  R'  is  the  same  as  the  integral  over  R  +  R'  of  the  matrix 

r 

product  u  L  .  Taking  the  given  value  of  I  $  d(x)  to  be  1 
x  ux  R  ♦  R' 


so  that 


I 


1 :  ♦  R’ 


uZ  d(x)  =  2 ,  we  see  that 

^  ~  l  I  ^ux  Lu  ^  d 
L  R  «•  R’  *  U* 


(*) 


If,  in  particular,  L  =  ux  ux*  >  s°  0411  =  ux*  *  we 


have 


u  L  =  2L  (this  relation  being,  indeed,  valid  when  L  is  any 
x  u 

x 

homogeneous,  or  positively  homogeneous,  function  of  degree  2  of 
the  1  x  n  matrix,  ux).  Hence 


•  / 


L  d(x)  =  I 


R  +  R* 

We  shall  now  consider  in  some  detail  the  particular  constrained 
problem  of  Type  2  of  the  calculus  of  variations  in  which  u  is 
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assigned  the  value  zero  over  R' ,  and  L  *  5  ux  ux  '  Then 

div  L  =  Tr  u  .  =  A0  u  ,  and  the  extremal  surfaces  are  determined 
u%  x*x  ^ 

by  the  partial  differential  equation: 

u  ♦  A  u  =  0  ;  x  R  *  R' 

and  the  boundary  condition: 
u  -  0  ;  xeR' 

The  value  of  the  constant  A  ,  which  is  that  of  our  i  .tegral  I 

extended  over  the  extremal  surface,  is  notgiven.and  it  is  part  of  our 

problem  to  determine  it.  An  obvious,  but  trivial,  extremal  surface 

is  that  furnished  by  the  equation  u  =  0  ,  x  <=  R  ♦  R’  ■  and  for  this 

extremal  surface  W  0 .  If  there  exists  a  nontrivial  extremal 

surface,  i.e.,  one  for  which  u  is  not  identically  zero  over  R  -  R'  , 

the  corresponding  value  of  A  is  positive  (  since  u  is  not  constant 

over  R  ♦  R’ ,  its  value  over  R'  being  zero,  and  so  the  integral  of 

T  -  1  u  u  *  over  our  extremal  surface  is  positive).  We  term  any 
u  3  x  x 

value  of  A  ,  for  which  the  boundary -value  problem 
a  u  +  au  =  0,  x  c  R  ♦  R' ;  u  =  0 ,  xlR1 

it 

possesses  a  nontrivial  solution,  a  characteristic  number  o(  this 
boundary-value  problem.no  that  all  the  characteristic  numbers  ol  this 
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particular  boundary- value  problem,  i 1  any  such  exist,  are  positive. 

If  our  constrained  problem  of  Type  2  of  the  calculus  of 
variations  possesses  a  minimal  surface  S ,  the  corresponding 
boundary -value  problem  possesses  a  characteristic  number,  since 

the  constraint  i  I  u2  d(x)  =  1  prohibits  u  from  being 
1  R  ♦  R' 

identically  zero  over  R  ♦  R' .  Let  us  suppose  that  the  minimal 
surface  S  furnishes  an  absolute  minimum  of  the  integral  I  for 
all  functions  u  which  are  zero  over  R'  and  which  respect  the 


condition 


I 


d(x)  =  1 .  If  v  is  any  function  of  x  which 


R  ♦  R' 

is  continuous  over  R  ♦  R'  and  zero  over  R' ,  we  can  determine  the 
constant  multiplier  c  so  that  u  =  cv  respects  the  condition 

1  I  u2  d(x)  =  1 ,  and  so  i  j  (u*  u  )  d(x)  >  A,  or, 

2  R  ♦  R'  2  R  ♦  R’  X 

equivalently,  ^  J  (v^  v^)  d(x)  >  -y-  Aj 
R  ♦  R'  c 

1  f  2  1 

This  implies,  since  *  i  v  d(x)  =  — n~  ,  that 

2  JR*R’  C2 

I  (v*v  -  A.  v2)  d(x)  0 .  In  other  words,  A.  is  a  lower 
R+R'  X  X  1 
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bound  of  the  various  Rayleigh  quotients 


(v  *v  )  d(x) 

X  x 


R  ♦  R’ 


R  ♦  R 


,v  d(x) 


Example.  Let  R  be  the  circle  of  radius  a  with  center  at  the 
origin, and  set  A  j  =  k2  .  When  written  in  polar  coordinates  the 

equation  A2  u  ♦  Aj  u  =  0  appears,  on  the  assumption  that  u  is  a 

function  of  r  alone,  an  V  .  1  «r  •  k,2  u  »  0,  and  .he  only  solutions 

of  this  equation  which  are  finite  at  r  =  0  are  constant  multiples  of 
Jq  (kj  r)  where  JQ  is  the  Bessel  function  of  index  zero  of  the 

first  kind.  Thus  k  a  is  the  smallest  positive  zero,  2.4048,  to 

1  }  2 
4  decimal  places,  of  this  Bessel  function ,  so  that  a  =  5.  7832 

„2 

The  Rayleigh  quotient  for  v  =  cos  ^  is  the  product  of  ^2 


Wgjrdr*  J  (cos2^)rdr 


'  21 

=  f  *  (l-cos  ^)rdr  -  |  (1+cos^jrdr 

Jn  a  J0 


2  2 
l  *  4  hence  the  product  of  this  Rayleigh  quotient  by  a  is 

' 
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— w -  =  5.8304,  to  4  decimal  places,  this  product  being  greater 

4(»  -  4) 

than  the  square  of  the  least  positive  zero  of  Jq  by  less  than  S  units 
in  the  second  decimal  place. 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 


Lecture  15 

Multiple-Integral  Problems  Whose  Lagrangian  Functions  Involve 
Derivatives  of  Higher  Order  than  the  First 

We  now  suppose  that  L  involves,  in  addition  to  the  N  x  1 

matrix  x  I  and  the  1  x  n  matrix  uw  ,  the  n-dimensional 

u  I  * 

matrix  ux,x  .  This  matrix  is  symmetric, and  we  could,  if  we  so 

desired,  confine  our  attention  to  those  n  ^ — —  of  its  elements 

which  lie  on  and  above  its  diagonal.  The  argument  proceeds  more 
easily,  however,  if  we  write  L  as  a  symmetric  function  of  the 
symmetric  matrix  ux.x  as  follows;  Wherever  we  encounter  any 

nondiagonal  element  u  p  q  ,  P  A  q,  of  ux#x  ^  L,  we  replace 

it  bv  -  u  n  o  *  u  a  d  ,  and  we  define  the  n-dimensional  matrix 
2  xp  x4  xH  xF 

L  by  regarding  the  n  elements  of  u  -  ,  which  we  encounter 

u  .  *  * 

x*x 

in  L  when  thus  expressed  as  a  symmetric  function  of  ,  as 

independent  variables,  ignoring  the  relations  u^q  z  uxp  xq  ’  q  ^  p’ 
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which  express  the  symmetry  of  the  matrix  ux*x-  Then  the 
contribution  to  5L  from  the  elements  of  ux,x  is  the  trace,  i.  e. , 

the  sum  of  the  diagonal  elements,  of  the  product  of  the  two 

n-dimensional  matrices  L  and  6  ux>x  ,  a  typical  element 

ux*x 


of  this  sum  being 


L  ^  U  _  fl  I  Pi  *1  ^ .  n' 

uxpxq  xPxq 

Assuming  that  the  symmetric  n-dimensional  matrix 


x*x 


possesses  over  R  ♦  R  continuous  second  derivatives  with  respeci 
to  the  n  x  1  matrix  x  ,  we  may  Integrate  by  parts  each  of  the  term 


f  |l  6  u 

jR  >  R’  I  UxP  x<l  1 


p  xq 


d(x)  which  appear  in  6  I  due  to  the 


dependence  of  L  on  ux,x  .  Writing 


L  6  u  _ 

V  *<l  * 


-Li  5  u  ♦  I L 

u  p  q  xp  xq  *  u 
xF  x^ 


xp  xq'xq 


5  u 


xP 


I 


L 


I 


uxp  xq 


xq 


u 


p*q|*q 


X*'  X 


(l  |  »u 

\  UxPx"lxPxq 


we  have 


-  5u 


=  L 


6u 

p*q  xI 


X^X 


“  1 

/  (L  1  fiu\  ♦  1 

L 

xq 

il  uxpxqixq  'xP 

Uxpxq 

6u 


xW 


i- 


148 


and  the  integrals  of  the  first  two  terms  on  the  right  over  R  ♦  R’ 
may  be  replaced  by  integrals  over  R'  Since  5  u  is,  by 
hypothesis,  zero  over  R’ ,  the  second  of  these  two  integrals  over 
R*  is  zero.  Similarly,  the  first  of  the  two  integrals  over  R’  would 
be  zero  if  we  assumed  that  the  1  x  n  matrix  6  ux  were  zero 


over  R’ ,  but  since,  after  we  have  performed  the  integration  by 
parts,  we  have  to  sum  with  respect  to  p  and  q  ,  it  is  not  necessary, 
in  order  to  assure  that  this  integral  over  R'  be  zero,  to  make  such  a 
strong  assumption  concerning  the  1  x  n  matrix  6  u^  .  If  n  =  2, 

so  that  R'  is  a  plane  curve,  the  integral  with  which  we  are  concerned 
is  the  line  integral 


and  it  suffices  to  assume  that  the  integrand  of  this  line  integral  is  zero 
i.  e.  ,  that  the  product  of  the  2x1  matrix 

by  the  1  x  2  matrix,  (6u  ,  6u  ) ,  is  zero 

J 

over  R’.  When  n=3,  so  that  R'  is  a  surface  in  3- dimensional 


L  dy  -  L  dx 
u  u 

xx  xy 


L  dy  -  L  dx 
\  u  u 

\  yx  yy 


space,  the  integral  with  which  we  are  concerned  is  the  surface  integral 


L  (L»: 


flu  ♦  L  flu  ♦  L  flu  ]  d  (y,  z) 
x  u  y  u  z  1 
XX  yx  1  zx 


L  flu  ♦  L  flu  ♦  L  flu 
u  x  u  y  u_„  z 
xy  yy  zy 


L  flu  +  L  flu  ♦  L  flu 
x  u  y  u  z 
xz  yz  zz 


1  2  3 

x  =  x  ,  y  =  x  ,  z  =  x 


d  (z,  x) 
d(x,  y)l  ; 


and  it  suffices  to  assume  that  the  product  of  the  3x1  matrix 
/ Lu  d(y,  z)  ♦  Lu  d(z,  x)  ♦  Lu  d(x,  y)\ 


xx 


*y 


xz 


L  d(y,  z)  +  L  d(z,  x)  ♦  L  d(x,  y) 

uyx  yy  yz 

\ L  d(y,  z)  ♦  Lu  d(z,  x)  ♦  Lu  d(x.  y)  j 

\  u  zx  zy  zz 


by  the  1x3  matrix  (flu.,  flu, ,  flu  )  is  zero  over  R' ;  and  so  on. 

Making  these  assumptions,  namely, 

1)  u  is  prescribed  over  R' 

2)  The  product  of  the  n  x  1  matrix  L  dS  by  the 

x*x 

variation  of  the  gradient  1  x  n  matrix  ux  of  u  is  zero  over  R' , 


we  have 
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61 


^R+R' 


z 

P,Q 

1 


-  7  II  !  ♦  L  )  5u 

xV  ^ 


d  (x) 


and  the  Euler- Lagrange  equation  is 


I  lL„ 


?■  1  I  xVlxV  i’1 


*■  L  *  0  . 

"JxJ  U 


We  have  previously  denoted  the  simple  sum,  Z  I  L 


j=i  »  “xJ'xi 


by  div  L  ,  and  we  now  denote  the  double  sum,  Z  |LU  I  » 
ux  p.ql  xViW* 

1 

by  (div)2  L  In  this  notation,  the  Euler- Lagrange  equation  of 

ux*x 

of  our  problem  of  Type  2  of  the  calculus  of  variations  appears  as 

(div)2  L  -  div  L  ♦  L  =  0  . 
u  ^  u  u 

x*x  X 

An  important  example  of  the  theory  is  that  for  which 

L  (A„u)2  -  2  g  u,  where  A„u  =  Tr  u  #  and  g  is  continuous  over  R  *  RV 

Here  L  =  2  A-  u  cl  ,  where  E  is  the  n-dimensional  identity 
u  „  2  n  n 
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matrix  and  L  is  the  zero  n  x  1  matrix.  Since  (div)*  A2u  is 
ux 


V  =  p^UxPxPxPxP  *  2  p£q 

1 


u  ,  l.  e. 

xPxPx^x^ 


the  symbolic  square 


of  A2u  ,  the  Euler- Lagrange  equation  is 
A4u  =  g 

a  partial  differential  equation  of  the  fourth  order;  when  g  =  0 ,  this 
equation  is  known  as  the  biharmonic  equation.  Since  L  is  a 

multiple  of  the  n-dimensional  identity  matrix,  the  second  of  our  two 
assumptions  listed  above  states  that  the  product  of  the  n  x  1  matrix 
dS  by  6  u  is  zero  over  R'  or,  equivalently,  that  the  normal 

directional  derivative  ur  of  u  prescribed  over  R’;  indeed 
dS  is  a  multiple  of  u x<  and  6  ux  *  \  6  so  that  uxux, 

2 

is  prescribed  over  S  and  uxux*  un  ■ 

By  the  same  argument  as  in  the  case  where  L  was  a  function  of 
X  and  u  ,  not  involving  derivatives  of  u  of  higher  order  than  the 

first,  we  see  that  if  our  problem  of  Type  2  of  the  calculus  of 
variations  is  subiect  to  a  constraint  of  the  type,  /  ^  d  (x)  =  constant’ 
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then  the  Euler- Lagrange  equation  is 

(div)2  L  -divL  +  L  =  X  0 

u  .  u  u  u 

x*x  X 

where  X  is  an  undetermined  constant.  In  particular,  when  the 

f  2 

constraint  is  u  d  (x)  =  constant,  this  equation  is 

JR  +  R' 

(div)2  L  -  div  L  +  L  =  2Xu 
'  u  „  u  u 

X*X  X 

2 

and  this  reduces,  when  L  =  (Ag  u)  ,  to 

A.u  =  Xu 
4 

The  values  of  X  for  which  this  partial  differential  equation  possesses 
solutions,  not  identically  zero,  over  R  +  R’  which  vanish,  together 
with  their  normal  derivatives,  over  R’  ,  are  the  characteristic 
numbers  of  this  boundary- value  problem,  the  corresponding  functions 
u  being  the  characteristic  functions  of  the  boundary -value  problem. 
It  is  easy  to  see  that  all  these  characteristic  numbers  are  positive; 
indeed,  on  multiplying  the  equation,  A4u  =  Xu ,  by  u  and  integrating 
the  resulting  equation  over  R  +  R’ ,  we  obtain 
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/ 

JR+R' 


d  (x)  =  u  A.u  d(x) 

^R+R' 


*  1.  I  “  “  P.P.P.P  d  *  *  2!  I ..."  VxPx«x1dW 


p=l  'R+R'  xpxpxpxp  p<q  "R+R' 

1 


n  r 

-T  u  u  d(x)  -  2  I  u  nx  d(x) 

£1  jR+R’  xp  xpxPxp  W  pVR.R'  xp  xPxqxq 


n  r 

1 


_ 

since  u  =  0  over  R' .  Since  £  u  dSp  =  0  over  R',  we  have 

p-1  xp 

t  J  u  u _ d(x)  =  -  £  j  (u  „  J2d(x)  and 


p-1  R+R'  xp  xpxpxp 


W" XPXP' 


p=l  R+R' 


Y  f  uu  d  (x)  =  -  y  |  U  u  d  (x)  and 

p<q  R+R’  xp  xPxqxq  p<  q  R+R’  xpxp  xqxq 

1  1 


so 


a/  u2  d  (x)  *  /  <  £  (u  )2  +  2  V  u 

JR+R*  R*-R'  xP*P  p<q 


1 


u  >d  (x) 
p<q  xpxp  xqxqf 

1 


-J 


R*R 


(^2  u)  d  (x)  . 


In  other  words:  each  characteristic  number  X  is  the  quotient  of  the 
integral  jR+Rt  (&2  u)2  d  (x)  by  the  integral  /R^R,  u2  d  (x) ,  where 
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u  is  a  nontrivial  solution,  over  R  -  R  ,  of  the  partial  differential 
equation  a4  u  =  X  u  ,  which  vanishes,  togethe-  with  its  normal 
derivative,  over  R’  . 

Let  us  now  assume  the  existence  of  a  function  uj  =  u  (xl ,  not 

identically  zero,  which  possesses  over  R*R' ,  a  continuous  Laplacian 
A2u  ,  and  which,  finally,  is  such  that  in  the  class  of  all  such  functions 


J  (A.u)  d(x) 

R+R'  ^ 

u  ,  the  quotient  Q  =  ,  - -  has  an  absolute  minimum, 

d<*> 


m 


^  say,  when  u  =  Uj  .  Then,  u  being  any  member  of  this  class  of 


2  2 1 

functions,  \  (&2  u)  -  m^  u  jd  (x) 


^  0  ,  the  equality  holding 


when  u  =  Uj  .  Writing  u  =  Uj  ♦  sf  ,  the  integral  last  written  becomes 

a  differentiable  function  of  s  whose  derivative,  at  s  =  0,  must  be  zero 
so  that 


A2  Uj  A2  j-  -  mj  Uj  f|  d  (x)  =  0  . 


Since  the  normal  derivative  of  j-  vanishes  over  R  , 
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I  (4  u  4  ,fUW=  -  L  j  (u  U  )  f  p}“  w  ,  “d 
JR+R'  212  pi  R*R'\  2  1  vP  x  1 

this  is,  in  turn,  since  ^  =  0  over  R'  ,  -  f  (A.u.)  f  )  ^  (x) . 

^R*R'I  I 

f  i 

t\(^x  -  m  jUj  )  f  ^  d  (x)  =  0 ,  and  this  implies,  since  f 


Hence 


HR 


is  arbitrary  tA«.ept  that  it  possesses  a  continuous  Laplacian 

over  R  ♦  R  and  vanishes,  together  with  its  normal  derivative  over 

R' ,  that  A4  Uj  -  mj  Uj  =0  over  R  +  R  so  that  is  one  of 


the  characteristic  numbers  of  our  boundary  value  problem  If  X  is 
any  characteristic  number,  the  quotient  Q  for  any  characteristic 
function  which  corresponds  to  X  is  X,  and  so  X  ^  nij  ;  thus  m} 


is  the  least  characteristic  number,  and  we  may  use  the  Rayleigh- 
quotient  method  or  the  Rayleigh- Ritz  modification  of  this  method  to 
approximate  this  least  characteristic  number. 

In  order  to  deal  with  characteristic  numbers  other  than  the  least, 
let  us  assume  the  existence  of  a  function  u2  not  identically  zero 
possessing  a  continuous  Laplacian  over  R  -  R  and  vanishing,  together 
with  its  normal  derivative,  over  R’,  and  which  is  such  that 
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u.  u_d  (x)  =  0  and,  finally,  which  has  the  property  that,  in  the 
R+R'  1  1 

class  of  all  such  functions, 

|  (A,  u)2  d  (x' 

R+-R’  * 

Q  =  — , - -  has  an  absolute  minimum,  nrij,  say, 

>R*R-u2  d  (X> 

when  u  =  u2-  Then,  u  being  any  member  of  this  class  of  functions, 


R-R 


(^2u) 


m2u 


2\ 


d  (x)  >  0 


the  equality  being  valid  when 


u  =  u2-  If  i  is  any  function  possessing  a  continuous  Laplacian 
and  vanishing,  together  with  its  normal  derivative,  over  R’ ,  we  may 


determine  the  constant  k  such  that  u  g  d  (x)  =  0 ,  where 

J  r%  .  n  ♦  * 


R+R’ 


g  f  •  k  u.  ,  k  being,  in  fact,  the  negative  of  the  quotient  of  the 


integral 

R*R' 


r  2 

u  f  d  (x )  by  the  integral  u  d  (x)  . 

VR+R' 


Writ  mg  u  u„  *  sg,  so  that  i  u  d  (x)  =  0  ,  no  matter  what 

1  R-R' 


the  valu*  iif  s.  we  find,  as  before,  that 


Writing  g  =  -f  ♦  k  u, ,  the  coefficient  of  k  in  J  \A2u2*2g  '  m2u2gi  d(x) 
1  R*H  I  ) 


is  easily  seen  to  be  zero;  indeed,  this  coefficient  is 


R  +  R 


(A2U2A2U1  '  m2U2Ul)  d  (x)>which  is  the  Same  aS 


(A,u,A,u.)  d(x),  since  u  u 

t+-R'  2  2  2  1  R*R 


u„u  d(x)  =  0  ;  finally, 


(A.u-A-u,)  d(x)  sVid(x).  since  u2  vanishes, 

Vr  2  2  1  R*R’ 

together  with  its  normal  derivative  over  R'  ,  and  this  last  integral  is 

zero  ,  since  A^  =  miui  over  R  +  R  '  Hence 

I  A0u0A,f  -  m,u  A )  d  (x)  =  0  ,  and  the  same  argument  as  before 

W  222 

shows  that  this  implies  that  A^  =  m2u2  ,  so  that  m2  is  a  characteristic 
number.  If  X  is  any  characteristic  number  >  nij  ,  with  the  associated 


characteristic  function  u,  the  relation  J  ui 

R+R 


indeed  ,  <A2ulA2u)  d(x)  =  J  ,(ulA4u)  d(x)  =  X 
R+R’  R+R 


u,u  d  (x)  =  0  must  hold; 


/  ulu 

-  T%  .  O  »  1 


and  similarly,  i  (A^  A  u)dW-  <VVl)d(x)  =  ml  d(x)> 

R+R.  1  R+R  K+K 
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Since  X  >  m,  the  equality  XJ  u.ud(x)  =  m.  j  u  u 
1  R*R'  1  1  R+R’ 


a  y\> 


implies  that  I  u  ud(x)=G.  Hence. the  quotient  Q  for  u  >  m2 
n*R'  1 

so  that  X  ^  m2  ,smce  Q  =  X  when  u  is  a  characteristic  function 

associated  with  the  characteristic  number  X.  Proceeding  in  this  way, 
we  assume  the  existence  of  a  function  u^ ,  not  identically  zero,  and 

possessing  the  following  properties: 

1)  ^  u3  exists  and  is  continuous  over  R*R'. 


2)  u„  and  (uj  vanish  over  R’  . 

3  on 

3)  Both  u  u„  d  (x)  and  j  u,u,  d  (x)  *  0 

Vr'  1  3  R*R  1  J 


/  ,(A2u 

4)  Q  - 


)2  d  (x) 


—  has,  m  the  class  of  all  functions 


'R-R'  u“  d  (x) 


possessing  properties  1J,  2),  and  3) ,  an  absolute  minimum  rn^ , 

say,  when  u  u3  .  Then  m3  is  a  characteristic  number  of  our  boundary- 
value  problem,  and  if  X  is  any  characteristic  number  ^  X2»  ^  ^  nn^  . 
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Lectures  on  Applied  Mathematics 
The  Calculus  of  Variations 
Lecture  16 

The  Courant  Maximum-Minimum  Principle 

We  have  seen  that  the  least  characteristic  number, 
of  the  boundary-vaiue  problem 

A^u  ♦  Au  0  ,  x  R+R '  ;  u  0,  x._R 

is  a  lower  bound  of  the  various  Rayleigh  quotients 

R  (v)  -  (  A.v  d(x)  /  j  v2  d(x)  ;  AjV  =  vx  vx, 
k+R'  1  <  R*R' 

formed  for  all  functions  v  which  possess  continuous  gradients  over 
R*F'  and  which  vanish  over  R'  .  If  u^  is  a  characteristic  function 

corresponding  to  j  A.u.  d(x)  -j  ul^2Ul  ’  since 

1  R*R'  R*R' 

u,  is  zero  ner  R',  and  so  *^iu1  ^(x)  *1  U1  ' 

1  RfR’  1  1  R*R' 

Hence,  R  (uj)  A}  ,so  that  Aj  is  the  greatest  lower  bound  of  the 
various  Rayleigh  quotients  Rfv)  formed  for  all  functions  v  which 
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possess  continuous  gradients  over  R+R'  and  which  vanish  over  R'  . 
By  the  same  argument  as  that  used  at  the  end  of  the  preceding  lecture 
when  discussing  the  boundary-value  problem 

A.u+Au  =  0,  x  -  R+R’  ;  u  =  0,u  =  0,  x  R’ 

4  n 

we  see  that  the  next  larger  characteristic  number  Aj  is  the 

greatest  lower  bound  of  the  Rayleigh  quotients  R(v)  formed  for  all 

functions  v  which  possess  continuous  gradients  over  R+R’  and 

vanish  over  R*  ,  and  which,  in  addition  are  such  that 

u.v  d(x)  =  0.  Similarly,  the  characteristic  number  A 
JR+R'  1  J 

which  is  the  next  larger  to  A2  is  the  greatest  lower  bound  of  the 

Rayleigh  quotients  R(v)  formed  for  all  functions  v  which  possess 

continuous  gradients  over  R+R'  and  vanish  over  R' ,  and  which,  in 

addition  are  such  that  both  of  the  relations  '  u.v  d(x)  =  0  and 

R+R' 

u0v  d(x)  =  0  ,  where  u.  and  u„  are  characteristic  functions 
R+R'  2  1 

corresponding,  respectively,  to  the  characteristic  numbers  Aj  and 
A2  ,  hold;  and  so  on. 

The  various  characteristic  functions  Uj  ,  u2,  ...  , 
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corresponding,  respectively,  to  the  different  characteristic  numbers 
Xj  ,  Xj,  ...  satisfy  the  relations: 

I  •  U- u.  d(x)  =  0  ;  u.u_  d(x)  =  0  ;  .  .  . 

R+P’  1  2  R+R'  J 

j  u2u3  d(x)  =  0  ;  .  .  . 

R+R' 

and  there  is  no  lack  of  generality,  since  each  of  these  characteristic 
functions  may  be  multiplied  by  any  constant,  other  than  zero,  in 

2  I  2 

assuming  that  /  u,  d(x)  =  1  ;  /  u.  d  (x)  =  1 . 

R+R’  R+R' 

1  2 

and  we  shall  do  this.  Let  v  =  c  Uj  (x)  +  c  u2  (x)  be  any  linear 

j  2  12  2  2 

combination  of  u.  and  u_  ;  then  J  v  d(x)  -  (c  )  +  (c  ) 

R+R' 

To  determine  the  Rayleigh  quotient  for  v  we  observe  that 
AjV  =  (cV^jUj  +  2c1c2  (Uj)x  (u2)x,  +  (c2)2AjU2  . 

The  integral  of  (uj  (u„)  „  over  R+R'  is  the  negative  of  the 
integral  of  UjA2u2  over  R+R',  since  Uj  vanishes  over  R' , 
and  this  is  the  product  by  X2  of  the  integral  of  UjU2  over  R+R'  . 
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We  know  that  this  latter  integral  is  zero,  and  so 

A.v  d(x)  =  (c1)2  A.u.  d(x)*(c2)2J  A.u2d(x) 

1  1  R+R'  1  R+R’ 

=  (cl)2Xj  ♦  (c2)2  X2,  since 

u,2  d(x)  =  1,1  u  2  d(x)  =  1 
JR+R'  1  JR*R’  1 

(cl)\(c2)\ 

Hence  R  (v)=  — pg - g-g -  ,  which  is  ^  X2, since  <  X2  . 

(c  )  ♦  (c  ) 

Similarly,  the  Rayleigh  quotient  for  any  linear  combination  of  , 
u  ,  u  is  ^  X,  ,  and  so  on.  The  useful  feature  of  this  result  is 
that  all  the  characteristic  numbers  X2  ,  X^,  ...  ,  other  than  the 
least,  X j  ,  play  the  role  of  upper  bounds  rather  than,  as  previously, 
lower  bounds. 

Let  us  now  consider  any  function  w  which  is  integrable 
over  R+R'  ,  it  being  not  required  that  w  possess  a  continuous 
gradient  over  R+R'  nor  that  it  vanish  over  R'  ,  and  let  us  form 
the  Rayleigh  quotients  for  those  functions  v  which  possess  con¬ 
tinuous  gradients  over  R+R'  and  vanish  over  R’ ,  and,  in  addition, 
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are  such  that  j  w  v  d(x)  =  0  .  Among  such  functions  v  is  a 

R+R' 

12  1 
linear  combination,  c  Uj  +  c  Uj,  of  u^  and  Uj,  the  ratio  of  c 

to  c2  being  determined  by  the  equation  c*  I  wu-d(x)  +  c  /  wugdM^O, 

J R+R'  1  R+R' 

r  {  1 

unless  both  I  wu.d(x)  and  ]  wu9d(x)  =  0 ,  in  which  case  c  and 
jR+R'  1  R+R* 

c2  may  be  arbitrarily  chosen.  The  minimum  of  these  Rayleigh  quotients, 
which  minimum  depends  on  w,  is,  accordingly,  ^  Xg  no  matter  what 
the  function  w.  We  have  already  seen  that  if  w  =  u^,  this  minimum 

is  Xj,  the  corresponding  choice  of  v  being  Ug-  Thus,  is  the 

maximum,  as  w  is  varied,  of  the  minimum  of  R  (v) ,  formed  for  all 

functions  v  possessing  continuous  gradients  over  R+R'  and  vanishing 

over  R'  ,  and  in  addition, such  that  w  v  d(x)  =  0  where  w  is 

R+R' 

any  function  which  is  integrable  over  R+R’  . 

Similarly,  if  Wj  ,  wg .  w  j  is  any  set  of  p  linearly 

independent  functions  which  are  integrable  over  R+R' ,  p  =  2,  3 . 

the  minimum  of  the  Rayleigh  quotients  R  (v)  formed  for  all  functions 
v  possessing  continuous  gradients  over  R+R'  and  vanishing  over  R', 


ir?4 


and  in  addition,  such  that  the  p-1  relations 


l  w  v  d(x)  =  0  ;  .  .  . 

R+R' 


R+R 


w  .  v  d(x)  =  0 
f  P~  1 


hold,  is  <  X  .and  X  is  the  maximum  of  this  minimum  as  the 
P’  P 


p-1  functions  w.  ,  .  .  .  ,  w  ,  are  varied.  This  result  is  known 
r  1  p-l 


as  the  maximum-minimum  principle  of  Courant. 

We  now  avail  ourselves  of  the  Courant  maximum-minimum 
principle  to  show  how  to  obtain,  by  means  of  the  Rayleigh-Ritz 
method,  approximations  not  only  to  the  least  characteristic  number 
of  the  various  boundary-value  problems  we  have  encountered  but 
to  the  larger  characteristic  numbers  as  well.  Let  p  be  any  positive 

integer,  and  let  Vj .  v  be  p  linearly  independent  functions, 

each  of  which  possesses  a  continuous  gradient  over  R+R'  and 
vanishes  over  R'.  If  v  is  the  1  x  n  matrix  (v^,  .  .  .  >vp)>  an<^ 
c  is  any  constant  p  x  1  matrix,  the  matrix  product  vc 
possesses  a  continuous  gradient  over  R+R'  and  vanishes  over  R'. 

The  Rayleigh  quotient  for  this  matrix  product  is  the  quotient  of  two 


homogeneous  quadratic  forms  in  the  p  x  1  matrix  c,  and  if  we  denote 
the  matrices  of  the  quadratic  forms  in  the  numerator  and  denominator 
of  this  quotient  by  M  and  N,  respectively,  we  have 
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R  (vc) 


c  ♦  M  c 
c  •  N  c 


We  seek  now  not  merely  the  aosolute  minimum  of  R  (vc) ,  but  the 
various  stationary  values  of  R  (vc) .  Multiplication  of  the  p  x  1 
matrix  c  by  any  nonzero  constant  leaves  R  (vc)  unaffected, 
and  we  may  determine,  since  c*  N  c  >0,  this  multiplicative 
constant  so  that  c  •  N  c  =  1  .  Thus  the  stationary  values  of  R  (vc) 
are  those  of  the  quadratic  form  c  •  M  c  ,  the  p  x  1  matrix  c 
being  supposed  subjected  to  the  constraint  c  •  N  c  =  1  .  In  other 
words,  at  any  stationary  point  of  R  (vc)  ,  the  linear  form  dc'  Me 
in  d  c  must  be  zero  for  every  p  x  1  matrix  d  c  which  satisfies 
the  relation  dc'Nr  0  .  This  implies  the  existence  of  a  number  U 
such  that  Mr  ^Nc,  J  being  any  one  of  the  zeros  o'  “,e  polynomial 
in  l a.  of  degree  p  ,  det  (M  -uNI.  What  we  propose  to  show  is  that 
not  only  is  the  least  of  these  zeros,  j  say,  ?  the  next 

greater  of  these  zeros,  ~2  say,  is  >2  and  so  on  to  the  greatest 

of  the  zeros,  ,  which  is  >  A^  .  Our  first  step  is  to  show  that 

there  is  no  lack  of  generality  in  taking  N  to  be-  the  p-dimensional 
identity  matrix.  Indeed,  if  we  write  v’  =  vC,  where  C  is  any  non¬ 
singular  p-dimensional  matrix,  the  relation  Vc’  -  v  c  implies, 
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since  the  p  elements  of  v  are  linearly  independent,  that  c  -  C  c’, 
so  that 


R  (vc)  =  Cc'*,  |^r'-~r  ,  where  M'  =  C*M  C  and  N'  =  C*N  C 

Then  det  (M'  -  pL  N’)  =  (det  C)2  det  (M  -  ^  N),  so  that  det  (M’  -  ^  N’) 
is  zero  when,  and  only  when,  det  (M  -  N)  =  0 .  Since  N  is 
positively  definite,  we  can  determine  a  rotation  matrix  R  such 
that  R*N  R  is  a  diagonal  matrix  D  with  positive  elements. 

Writing  C  =  R  D‘  1  2  ,  N'  =  C*N  C  is  the  p-dimensional  identity 
matrix  Ep  .  Thus,  det  (M'  -  U  Ep)  =  0  ,  so  that  p  { .  Up 

are  the  characteristic  numbers  of  the  p-dimensional  matrix  M'  ,  and 
our  task  is  to  show  that  *  ^  ^  \  ^  1 •  >  P  •  The  fact  that  N 


is  the  p-dimensional  identity  matrix  Ep  implies  that 

i  2 

v'  v.'  d(x)  =0  if  j  *  k  ,  while  [  (O  d(x)  =  1  ,  k  =  1, - P, 

R>R'  J  k  R*R’ 

and  these  relations  remain  valid  if  v'  is  subjected  to  the  transforma¬ 
tion  v'— >v"--vR,  where  R  is  any  p-dimensional  rotation  matrix. 
Under  this  transformation  M'- - >M"  =  R*  M'  R  and  R  may  be  so 
chosen  that  M"  is  the  diagonal  matrix  whose  diagonal  elements  are 

U  a  .  From  the  Courant  maximum- minimum  principle  we 

1  ’  ’  ^p 
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know  that,  if  2-^  r  ^  p,  is  the  maximum,  as  the  functions 

w* . wr are  varied,  of  the  minima  of  the  various  Rayleigh 

quotients  R  (u)  formed  for  those  functions  u  possessing 
continuous  gradients  over  R+R'  and  vanishing  over  R'  which 
are  such  that 

I  w*  u  d(x)  =  0  ;  .  .  .  J  wr  *  u  d(x)  =  0 
R+R'  R+R' 

To  this  class  of  functions  u  belong  those  members  of  the  family 

v  c  =  0 ,  for  which  ar^c  =  0,  ...  ,  ar  *  c  =  0 ,  where 

r  f 

or  *  -  J  w*  v  d(x) ,  ...  ,  ar1"  *  -  j  wr  *v  d(x) 

R+R'  R+R' 

Hence  the  minimum  of  R  (u)  <  /lj(c*)2  •*•...  ♦■Lip  (cP)^»  where 

the  p  x  1  matrix  c  is  subjected  not  only  to  the  quadratic  constraint 
(c1)2*...  ♦  (cP)2  but  also  to  the  r-1  linear  constraints  ar1c  =  0,  . 
ar  l  c  =  0  . 

Treating  first  the  case  r  =  2  ,  let  us  examine  the  minimum 

of  M.(c *)2  +  .  .  .  +  LL  (cp)2  where  c*  c  =  1  and  or*  c  =  0 ,  a  t  'ing 

1  p 

a  given  p  x  1  matrix.  At  any  stationary  point  of  (c1)2*  ...  ^Mp(cP)2, 
subject  to  these  constraints,  jl .  cJ  =  B  cJ  +  5  a* ,  where  B  and  6  are 
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undetermined  multipliers,  and,  since  c*c  =  1,  c  •  a  =  0  , 


=  ^  P.  (cj)2  is  the  v 


1,2 


j=l 


p.  (c  )  is  the  value  of  the  quadratic  form  /ij  (c  )  *  . 


+  /ip(cP)  at  the  stationary  point  in  question.  Denoting  by  D  the 


diagonal  p-dimensional  matrix  whose  diagonal  elements  are 


. /ip,  we  see,  on  eliminating  c  and  5  from  the 

equations  (D-B)c-5a  =  0,  a*  c  =  0  that  the  minimum  of 
ri^1)2  -*■  .  .  .  +  /ip  (cP)2  ,  subject  to  the  constraints  c  •  c  =  1  . 

a*  c  =  0,  is  the  least  zero  of  the  polynomial,  of  degree  p-1  ,  in  fi: 


P  (B)  -  det 


D-BEp  o, 


[  a*  0 

1  2 

The  value  of  this  polynomial  when  B  is  (a  )  (/ig*  ■  •  • 

2  2 

( p  -  P  j )  ,  its  value  when  P  -  P  ^  is  (a  )  (.ij  -  P^)  .  ■  . 

(/ip  '^2)'  ^d  30  on-  Thus  P(/ij)^0,  P  (/^l  ^  0 . The 

numbers  U\,  ■■■  ,  p  are  the  stationary  values  of 

P^  (c1)2  ♦  ■  ■  ■  ♦  P  (cP) 2  when  the  p  x  1  matrix  c  is  subjected 

to  the  single  constraint  c  *c  1  ,  and  so  we  have  the  following 
result: 
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When  the  p  x  1  matrix  c  is  subjected  not  only  to  the 

quadratic  constraint  c  *  c  =  1  but  also  to  an  arbitrary  linear 

constraint  a*  c  =  0,  the  stationary  values  U' .  •  ■  •  .  )J.' 

1  P-1 

of  the  quadratic  form  (c*)^  +  .  .  .  ♦  jl  (c*5)  interlace  the 
previous  stationary  values  ■  ,  /ip: 

'T1s.U2<^2<  ••  ^  ^p-1  **  ^p-l  ^  ^p 

In  particular,  the  minimum  of  R  (u)  ,  which  is  <  U\,  is  ^2‘ 

Since  this  holds,  no  matter  what  the  function  w\  it  follows  that 

A,.  .  which  is  the  maximum  of  this  minimum  as  w  is  varied,  is 
a  ’ 

^  ^  2  ‘ 

When  r  -  3  we  have  two  linear  constraints  <ij*  c  =  0, 

a  •  c  =  0  instead  of  merely  one.  We  use  the  first  of  these  two  linear 
2 

constraints  to  write  our  Rayleigh  quotient  R(vc)  as  a  quadratic  form 
in  a  (p-l)xl,  rather  than  a  pxl,  matrix  c’ .  The  quadratic 
constraint  c  •  c  =  1  appears  as  c**  A  c'  =  1 ,  and  this  may  be 
written  as  c"*  c"  =  1  by  setting  c'  =  B  c",  where  B  is  an  appro¬ 
priately  chosen  nonsingular  (p-1)  -  dimensional  matrix.  Hence, 
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L- 


by  the  result  for  r  =  2,  the  neu  stationary  values,  —  j  ,  • 

2  ’  p‘2  in  numb€r»  interlace  the  immediately  preceding 


stationary  values,  — 'j 


-  ^  p-l 


*4  'l  ^  ^1 


2  ^  U  2  ^  ‘ 


^'p-2  ^  u  p-2  ^  k‘  p- 1 


Hence  slnce  '2  3  ■  Continuing  in  this  way 

we  obtain  s,  si  ^  ,  j  1  ,  ■  ■  ■  ,  P  • 

Example  We  saw  in  Lecture  10,  when  discussing  the  boundary- 
value  problem:  *  A*  -  0,  -1  v.  t  <  1  .  x  (-1  0  x  (11,  that 

2 


f  _  ,  ,  3  35  *  146  *  Us 

the  minimum  01  H  (s)  -  ^  ,  2T  »  6s~L  s‘ 


is  a  good  approxima¬ 


tion  to.  being  onlv  slightly  greater  than,  the  least  characteristic 
2 

number.  ,  of  this  problem.  The  two  stationary  values  of  R  (s)  , 

4 

one  of  which  is  the  absolute  minimum  and  the  other  if  which  is  the 
absolute  maximum  of  R  (s) ,  are  the  zeros  of  the  quadratic  polynomial 

in 


det 


I  105  -  42  (J.  21  -  6/J 

21  -  6  /i  33  -  2U 


1=  48  (/i  -  28^  -  63) 
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The  smaller  of  these  two  zeros,  2.4674  to  4  decimal  places,  is  a 

2 

good  approximation  to  ,  which  is  also  2.4674  to  4  decimal 

places.  The  larger  of  the  two  zeros,  25.  5326  to  4  decimal  places, 
is  an  approximation,  in  excess,  to  the  second  characteristic  number, 
2 

9  Z _ =  22.  2066  ,  but  the  approximation  is  not  nearly  so  good  as  that 

4 

furnished  by  the  smaller  of  the  two  zeros. 
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